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Title: Expression of alpha-macroglobulins 

FIELD OF THE INVENTION 

The present invention relates to the expression of a-macrogl obu"- 
51ins, derivatives and variants thereof, and especially the expression of the 
human o^-macroglobul in ( tta M) in an active form in mammalian cells, and the 
expression of genetically engineered variants thereof. The use of such 
recombinant or-macroglobul ins, especially recombinant ^(ra^) and variants 
is described with examples from the fields of medicine for therapeutic 
10 purposes, and the development of novel defined growth media for propagation 
of mammalian cells in culture. 

BACKGROUND OF THE INVENTION. 

BIOCHEMISTRY OF n -MACROGLOBUI.IN laM\ . 

15 The Proteinase binding glycoprotein ajt, which is synthesized in 

the liver, constitute together with the complement proteins C3, C4 and C5 a 
separate class of structurally and functionally related large plasma 
proteins. For a recent review see (Sottrup-Jensen, L. (1987) in: The Plasma 
Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando 

20 FL). 

Apart from C5 these proteins contain an internal B-cysteinyl- 
7-glutamyl thiol ester, which enables the proteolytical ly activated forms 
of o^M, C3, and C4 to participate in characteristic covalent binding reactions 
(Sottrup-Jensen, L., et al., (1980) FEBS Lett. 121.: 275-280; Salvesen, 6.S. 

25 and Barrett, A.J., (1981) Biochem. J. 187: 695-701). The thiol ester 
structure, which in the active proteins can be slowly cleaved by a number of 
small nitrogen nucleophiles, constitutes a unique type of postsynthetic 
modification of proteins, and plays a prominent role in the biological 
properties of a^. The presence of the active thiol esters in a^l is revealed 

30 by a characteristic pattern of heat fragmentation (Harpel, P.C., et al . , 
(1979) J. Biol. Chem. 254: 8869-8878). 

Traditionally, o^M has been studied within' the context of plasma 
proteinase inhibitors, although by several criteria it is unique. Whereas 
most plasma proteinase inhibitors are monomeri"c~proteins* of roughly similar 

35 size, containing approximately 430-500 residues, ajl is a tetramer whose 180- 
kD subunits contain 1451 residues (Sottrup-Jensen et al . , (1984) J. Biol. 
Chem. 259: 8318-8327). 

Furthermore, in contrast to most other proteinase inhibitors, 
which form 1:1 complexes with serine proteinases engaging the active site 
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of the proteinase and the reactive site of the inhibitor, forms complexes 
with a broad spectrum of proteinases differing in their substrate specif i- 
city and catalytic mechanism e.g.: trypsin, leucocyte elastase, chymotrypsin, _ 
pancreatic elastase, cathepsin G, plasmin, plasma kallikrein and thrombin. 
5 The second-order rate constant for association between these 

proteinases and <g< varies by several orders of magnitude. Both 1:1 and 2:1 _ 
proteinase-^ complexes can be formed, and the disul fide-bridged dimer (360 
kD) appears to be the functional unit of or^ (Sottrup-Jensen, L. (1987) in: 
The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, 
lOOrlando, FL). Contrary to "classical" proteinase inhibitor complexes the 
bound proteinase is still active, especially toward small synthetic 
substrates (Sottrup-Jensen, I. (1987) in: "The Plasma Proteins" (Putnam, 
F W ed ) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL). 

The mechanism of proteinase binding by or^ has been described by 
15 the "trap" (Barrett, A.J. and Starkey, P.M. (1973) Biochem. J. 133: 709- 
724), where proteolytic cleavage of a particularly exposed peptide stretch 
near the middle of the 18a-kD subunit (the "bait" region) results in a 
conformational change of the <*M tetramer, thereby entrapping the proteina- 
se The nature of the essentially irreversible proteinase complex formation 
20 with a 2 M has long remained elusive. However, recent investigations show that 
a major fraction (typically > 80-90 % of the trapped proteinase is also cova- 
lently bound through epsilon-lysyl (proteinase) -7-glutamyl (cr 2 M) bonds (Sottrup- 
Jensen, L. et at., (1981) FEBS Lett. 128: 127-132; Sand, 0. et al . , (1985) 
J. Biol. Chem. 260: 15723-15735; Pochon, F. et al . , (1987) FEBS Lett. 217: 
25 101-105). 

PHVSTm nair.fll ASPECTS OF PROTEINASF-<r.M INTE RACTIONS. 

Since the ^-proteinase complexes are rapidly cleared from the 
circulation (Ohlsson, K. (1971) Acta Physiol. Scand. 8i_: 269-272; Imber, 
30M.J. and Pizzo, S.V. (1981) J. Biol. Chem. 256: 8134-8139.) a general role 
as a "clearing vehicle" for plasma proteinases has been envisaged. 

The main physiological targets may include proteinases of the 
coagulation and fibrinolysis systems and plasma Jcallikrein, and perhaps also 
proteinases like leucocyte elastase, cathepsin 6 and collagenases and other 
35 proteinases released during cellular turnover (Sottrup-Jensen, L. and 
Birkedal -Hansen, H. (1989) J. Biol. Chem. 264: 393-401). 

Although o^M may be largely confined to the vasculature in healthy 
uninflamed tissues, the inhibitor and its proteinase complexes are found at 
near plasma levels in inflammatory exudates of rheumatoid joints and gingival 
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crevicular fluids (Tollefsen, T. and Saltved, E. (1980) "j. Periodont. Res. 
15: 96-106; Borth, W. , et al . , (1983) Ann. N. Y. Acad. Sci, 421: 377-381 >r - 
While plasma orji appear to be synthesized in the liver (Schreiber, 
G. (1987) in: "The Plasma Proteins" (Putnam, F.W., ed) 2nd Ed., 5: 294-363, 
5Academic Press, Orlando, FL.) other sites of synthesis exist. Several cell 
strains in culture have been shown to produce ajl including fibroblasts 
(Mosher, D.F., et al., (1977) J. Clin. Invest. 60: 1036-1045) and monocytes- 
/macrophages (Hovi, T., et al . , (1977) J. Exp. Med. 145: 1580-1589). 

Whereas hepatocytes and Kupffer cells of the liver are most 

10 important for clearance of o^- proteinase complexes in plasma (Davidsen, O., 
et al., (1985) Biochim. Biophys. Acta 846: 85-92), fibroblasts (Van Leuven, 
F., et al., (1979) 0. Biol. Chem. 254: 5155-5160; Mosher, D.F. and Vaheri , 
A. (1980) Biochim. Biophys. Acta 627: 113-122) and macrophages (Debanne, 
M.T., et al., (1975) Biochim. Biophys. Acta 41_1: 295-304; Kaplan, J. and 

15Nielsen, M.L. (1979) J. Biol. Chem. 254: 7323-7328) also possess receptors 
for ctjM-proteinase complexes. 

These observations suggest that there may be a considerable 
extravascular turnover of or 2 M perhaps primarily carrying proteinases 
functioning in the cellular micro environment (Sottrup-Jensen, L. and 

20Birkedal-Hansen, H. (1989) J. Biol. Chem. 264: 393-401). 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention discloses a method for the 
production of recombinant a-macroglobul ins, and especially human a 2 M, and 
25 variants thereof in an active form. 

Within a preferred embodiment, the cultured host cell is an 
eukaryotic cell such as a mammalian cell or cells derived from organisms 
such as insects, plants, yeast or other fungi, such as Aspergillus . 

The invention further relates to DNA sequences comprising a gene 
30 encoding for the expression of human aj\ and variants thereof, vectors 
comprising such DNA sequences, and suitable hosts transformed with such 
vectors. 

Yet another aspect of the invention is the use of recombinant 
OjM and variants thereof as a protein carfier'Tn enzyme "replacement therapy 
35(ERT). 

Yet another aspect of the invention is the use of recombinant 
a-jM and variants thereof as a DNA carrier in gene therapy. 

Further aspects of the invention relates to the use of recom- 
binant or-macroglobulins, especially human o^M, and variants thereof as 
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constituents of growth media, either as an additive or co-expressed with a 
desired gene product. 

DEFINITIONS 

5 Prior to setting forth the invention it may be helpful for an 

understanding thereof to set forth definitions of certain terms to be used _ 
hereafter. 

Complementary DNA or cDNA: A DNA molecule or sequence which have been 
lOenzymatically synthesized from sequences present in a mRNA template. 

DNA Construct: A DNA molecule, or a clone of such a molecule, either single- 
or double- stranded, which may be isolated in partial form from a naturally 
occurring gene or which has been modified to contain segments of DNA which 
15 are combined and juxtaposed in a manner which would not otherwise exist in 
nature. 

Plasmid or Vector: A DNA construct containing genetic information which may 
provide for its replication when inserted into a host cell. A plasmid 
20 generally contains at least one gene sequence to be expressed in the host 
cell, as well as sequences encoding functions which facilitate such gene 
expression, including promoters and transcription initiation sites. It may 
be a linear or closed circular molecule. 

25 Joined: DNA sequences are said to be joined when the 5' and 3' ends of one 
sequence are attached by phosphodiester bonds to the 3' and 5' ends, 
respectively, of an adjacent sequence. Joining may be achieved by such 
methods as ligation of blunt or cohesive termini, by synthesis of joined 
sequences through cDNA cloning, or by removal of intervening sequences 

30 through a process of directed mutagenesis. 

Variant: A peptide related to the original peptide, "but wherein the amino 
acid sequence has been altered through mutation of the ^ gene encoding the 
original peptide. 

35 
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Guanine 
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Thymine (only in DNA) 
Uracil (only in RNA) 



BRIEF DESCRIPTION OF thf nPAu mn<r 

Figure la illustrates the construction of plasmid p!136 
Figure lb illustrates the construction of plasmid pi 167 
Flgure 2 illustrates the structure of plasmid P 1167 

of thp tK F l 9 r 3 illUStrat6S 3 * el electrophoresis (10 - 20 % SDS-PAGE) 
of the thermal fragmentation products generated from and r ttj M 

fr,o m ,,- Fl9Ure 4 illustrates a 9^ electrophoresis of the thermal 
fragmentation products generated from methylamine treated o,M and ra^. 
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F1g „re 5 illustrates a gel electrophoresis (SDS-PAGE, of the _ 

— riVn— a £ gel" electrophoresis of unreacted 

10Sup erose . «.,-. ^ ^ cUctrophoresis (10 20 » racing 

• » fv . nm r hvmntrvDsin treated human otaM, human 
SDS-PA6E) of the reaction products from chymotrypsin tr« 

PZP and ^-PZP. ■ jiiustrates ^ ^ electrophoresis (10 . 20 , duc ing 

j fvnm Plastase treated human a^M, human 
15 SDS-PA6E) of the reaction products from elastase xre 

PZP and ra 2 M-PZP. n 0/ re H U cinq 

Figure 12 illustrates the gel electrophore S1 s (10 ■ 20 * ™ dUC ™° 
S0S-PA6E) of the reaction products from trypsin treated human human 
a „d ^-PZP. niustrates ^ ^ eUctrophoresis (1 „ . »» reducing 

. SOS-PAGE) of the reaction products from Saptol^UJureui Glu-spec^c 
protease treated human a^, human P2P and ro,M-PZP. 

25 DEJAJLEB 0ESCMPT10H OF THF 1HVEI1TI0N 

According to the invention there .s prov,ded a process for tn 
production of o-macroglobulins, especially human 

fragments or derivatives, including variants thereof, where,n a functionally 
r« expression vector comprising a gene encoding for the express- 

30a\-macroglobulin, especially human o,-macro g l obu ,n, or fragmen 
derivatives thereof, including variants, or alleles of such , . gene is 
duced into a suitable host capable of expressing sa,d gene said hos s 
cultured in a suitable nutrient medium containing S^"^^ 
carbon and nitrogen and other essential nutr.ents and the expressed . 

35macroglobulin, especially human 0,-macroglobulin, or fragments or der,vat,ves 
thereof is recovered. 

Many proteins synthesized particularly in mammalian cells undergo 
post-translational modification (processing) of one kind or the other. 
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Depending on the final destination and on the specific function of a newly 
synthesized protein, it may go through a number of processing steps leading- 
to covalent modifications such as e.g.: glycosyl ation, 7-carboxyl ation, 6- 
hydroxylation, sulphatation, amidation, thiol ester formation, phosphory- 
Slation, proteolytic cleavage at precursor processing sites, fatty acylation 
(Rosner, M.R. (1986). in: "Mammalian Cell Technology", (Thilly, W.6. edl 
Butterworth Publishers, Stoneham, MA. : 63-89). 

Proteins of various sizes and with a variety of different post- 
-translational modifications have been successfully expressed in transformed 
10 heterologous mammalian host cells using recombinant DNA technology A few 
examples: Human coagulation factors Vila and IX have been expressed in trans- 
formed BHK (Syrian Baby Hamster Kidney) cells with correct post-translational 
modifications such as 7-carboxyl ation and glycosylate (Thim, L. et al 
(1988) Biochemistry 27: 7785-7793; Busby, S. et al., (1985) Nature 316: 271- 
15 273). Human Platelet-derived Growth Factor AB heterodimer has been expres- 
sed in transformed CHO (Chinese Hamster Ovary) cells with correct processing 
of the A and B chain precursors and correct assembly of the AB heterodimer 
Human coagulation factor VIII has been expressed in transformed CHO cells 
with correct processing of the precursor leading to a two chain molecule that 
20 can be activated by thrombin and factor Xa (Kaufman, R.J. et al . , (1988) J 
Biol. Chem. 263: 6352-6362; Pittman, D.D. and Kaufman, R.J. (1988) Proc 
Natl. Acad. Sci. USA 85: 2429-2433). 

So far, there have been no reports on the heterologous expression 
of proteins in which the formation of an active thiol ester is a prominent 
25 post-translational modification. 

The biosynthesis of the internal thiol ester in the third com- 
ponent (C3) of complement from rabbit has been investigated (Iijima, M. et 
al., (1984) J. Biochem. 96: 1539-1546). Rabbit liver mRNA was translated jn 
vitro in a rabbit reticulocyte lysate system, and the synthesized C3 specific 
30 products did not incorporate radio labelled methylamine. On the other hand 
radio labelled iodoacetamide reacted with the synthesized C3 specific 
products; these results indicated the presence in the primary C3 specific 
translation product of a free thiol group instead of a reactive thiol ester. 
If a liver homogenate supernatant (S-13) Including cytosoT and microsomes was 
35 included, the C3 specific product could now incorporate methylamine. By 
increasing the concentration of the S-13 component (s), the incorporation of 
methylamine in C3 specific products was increased, and at the same time 
incorporation of iodoacetamide decreased. If the S-13 fraction was treated 
at 65°C for 5 min, the activity was completely lost. 
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The results from this investigation strongly suggest an involve- 
ment of a transglutaminase-! ike or other type, of enzyme in the posttrans a- . 
To a formation of an active thiol ester in rabbit C3. There are no smnlar_ 
investigations addressing the formation of the thiol ester in other • a-macro - 
Sglobulins, e.g. ^ but from analogy and homology ^s^U^ vt 
expected that a similar mechanism is responsible for the formation of thiol _ 
esters in other a-macrogl obul ins synthesized in the mammalian liver. 

Through this investigation a number of developments were done 
lOwhich also are deemed to be encompassed of the present invention. These 
include DNA sequences comprising a gene encoding for the expression of «- 
macroglobulins, especially human « 2 -macroglobul in, or frag^or ^deriva- 
tives and variants thereof as exemplified in SEQ ID N0:1 and SEQ ID NO 3. 

Another aspect of the invention relates to functionally operative 
15 expression vectors comprising a gene encoding for the expression of at least 
one a-macroglobulin, especially human cr 2 -macroglobulin or fragments or 
derivatives and variants thereof, or alleles of such a gene. 

Such vectors preferably further comprise regulatory elements 
necessary for the stable maintenance of said vector in mammalian cells. 
20 Also, such vectors may further include sequences providing for 

. the processing and secretion of the expressed product. 

In relation to the use of recombinant a-macrogl obul ins, and 
especially m 2 M, in growth media it may be co-expressed with another desired 
gene product, and consequently the vectors of the invention may further 
25 comprise one or more other genes encoding for a desired gene product. 

The invention further relates to transformed hosts comprising a 
functionally operative expression vector according to the invention compri- 
sing a gene encoding for the expression of human c^-macroglobul in or fragments 

30 or derivatives and variants thereof, or alleles of such a gene. 

The host may be selected from the group comprising a bacterial 
strain, a fungal strain, a mammalian cell line, or a-mammal, especially a 
fungus, such as belonging to the genus Aspergillus , or a yeast strain, pre- 
ferabTy belonging to the genus SaccharomYce s. 

35 Another preferred type of host is a mammalian cell line, 

preferably a Syrian Baby Hamster Kidney (BHK) cell line, and especially the 
one which is available from ATCC under No. CRL 1632. 
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The invention further relates to the recombinant human o^- 
macroglobulin or a variant thereof in an active form having the amino acid, 
sequence of SEQ ID NO: 2, or SEQ ID NO: 4. 

5 APPLICATIONS OF ff -MACROGLOBULINS . ESPECIALLY raJI. 

The present invention discloses applications of a-macroglobul ins, 
and especially ro^M. These should be regarded not as limitations but as a 
few examples among many for the use of recombinant derived a-macroglobul ins. 

10 g-MACROGLORUI TNS AS CONSTITUENTS OF DEFINED GROWTH MEDIA. 

Degradation of specific heterologous products produced in either 
transformed or non-transformed mammalian cells is a potential problem in the 
production of recombinant products. This is due to the fact that many host 
cells secretes one or more different proteinases. 
15 Wh en a production cell line is grown in the presence of e.g. 10 

% fetal calf serum, such proteolytic degradation of secreted recombinant or 
native protein products is a minor problem due to a buffering effect of the 
added serum proteins. 

However, the use of fetal calf serum in the large scale growth 
20 (fermentation) of mammalian production cell lines is not a desirable 
situation for a number of reasons. First of all fetal calf serum is a very 
costly constituent of complex growth media; second, the demand for fetal 
calf serum from a growing biopharmaceutical industry might not be easily 
fulfilled in the future, and third, the use of fetal calf serum constitutes 
25a potential quality control problem in the production of pharmaceuticals 
intended for use in humans. 

To circumvent these problems, efforts can be expected in the 
field of development of defined growth media for use with mammalian cells. 

Addition of various proteinase inhibitors to such new defined 
30 growth media will be required to ensure the integrity of the secreted 
products. Alternatively, the producer cell line might, through genetic 
engineering, be endowed with the capacity to produce -and secrete proteinase 
inhibitors along with the desired product(s). 

a-Macroglobulins, and especial ly~Humari \ ~aj\; are proteinase 
35 inhibitors of broad specificity, and they are therefore according to the 
invention used as constituents of defined growth media for mammalian cells, 
either as a medium additive or as a product co-produced with the desired 
product. 
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The target sites for a number of different proteinases, e.g. 
bovine trypsin, Streptomvces ariseus trypsin, papain, porcine elastase, ~ 
bovine chymosin, bovine chymotrypsin, Staphylococcus aureus strain V8^ 
proteinase, human plasmin, bovine thrombin, thermolysin, subtilisin Novo and 
5 Streptomvces oriseus proteinase B have been mapped in the bait region of 
human a^ (Mortensen, S.B., et al., (1981) FEBS Lett. 135: 295-300) and other„ 
a-macroglobulins (Sottrup- Jensen , L . , Sand, 0., Kristensen, L. and Fey, G.H. 
J.Biol.Chem. 264,15781-15789, 1989). It is evident that a& and the other a- 
macroglobulins as proteinase inhibitors have broad specificities. 
10 In those situations, where the proteinase inhibitory spectrum 

of a a-macroglobulin, such as a^, is not sufficient for the prevention of 
product degradation, it is possible through site specific mutation, protein 
engineering, etc. to change the proteinase inhibitor specificity of the a- 
macroglobulin, such as a 2 M. Incorporation of desirable specific proteinase 
15 target sites in the bait region of recombinant a 2 M will change the inhibitor 
specificity of the mutated aj>l. Furthermore it is possible through genetic 
engineering to construct novel specific or general proteinase target sites 
in the bait region of a a-macroglobulin in order to enhance its versatility 
as a proteinase inhibitor of specific or broad inhibitory spectrum. 
20 Furthermore it is possible to remove specific target sites in an a- 
macroglobulin in order to avoid degradation of the variant in question by 
certain proteases in the circulation that will already be inhibited through 
the action of naturally present proteinase inhibitors. 

The production of recombinant products in fungi, such as species 
25 and strains of e.g. Aspergillus and Saccharomvces also meets with potential 
problems of product degradation. In some cases it is possible to isolate 
proteinase negative mutants of desirable production strains. This might not 
always be the case, and co-expression of a-macroglobulins, such as a^ or 
a^-mutants together with a desirable product may inhibit proteolysis of the 
30 product in question. 

g-MACROGLOBUI TN MUTANTS AS SPECIFIC PROTEINASE INHIBITORS. 

The amino acid sequence of the bait^ region of a-macroglobulins 
defines the specificity of the a-macroglobulin towards" di fferent proteina- 
35ses. A comparison of cleavage patterns for different proteinases and bait 
region sequences in five mammalian a-macroglobulins has recently been 
published (Sottrup-Jensen, L. , Sand, 0., Kristensen, L. and Fey, G.H. The 
a-macroglobulin bait region. Sequence diversity and localization of cleavage 
sites for proteinases in five mammalian a-macroglobulins. ,1. Biol. Chem. 264, 
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•WW-WW. 1969). It has previously been clearly demonstrated that the bait 
region ,„ eacn spec1es „ f a . mlcrogUbuU „ u determinant or 

proteinase Inhibitor specificity. The present invention demonstra tthT 

Sal e a " 1 m ° dUUt,n9 ^ ** WUr 

5 alterations of proteinase target sites in the bait region. 

of human .H 1 ," tHe / reSent inVent,on H 15 """«tr,t.d that the bait region 
of human (rescues 690 to 730 in SEQ ID N0:2, can be mutated at will to 

sen™ P t r h? inaSe T. 1 ^ Pr ° fi,e ° f The example 

loir! , h , PreS6nt ,nVenti ° n d " Cr1beS thc """ruction of a hybrid 

prote,„ ( PZ P) was )nt roduced into human from which the native bait region 
had been removed. The hybrid molecule, which was constructed by the use of 

to trie inhibitor profile of PZP. 
15 The invention thus demonstrates the possibility to design and 

produce proteinase inhibitors with altered and new inhibitor spec"" 

inhibit I" 15 f1 " d1n9 U ' mporU " t f ° r ^sign of new proteinase 
20 T t0 '° W '" U ^^y «» "ait region in macroglobulins 

20 Va Leuven, F Harynen, P., Cassiman, J.-J. and Van den Berghe, H Mappin 

nt tit [° Uti ° n ^ '» . pane, o, monoc „ 

3 , gas " a,pha - 2 — ^mmgljtethods ,„, 

Ma ynen P 3 c ela, -, E - Ta P-^taudiere, 0., Pochon.TT, 

Harynon, p., C ass,man, J. -J., van den Berghe, H. and Van Leuven, F The 

m :::: rr ation of Human A „ ImmMUct z 

" > y *" m °" 0C,0na ' ""bodies. J- Biol th,n, m , 2 g 81 . 2 , 89> 

9M ,t ,s now possible, by the use of the technology described in the 
present .nvention, to design non-immunogenic new proteinase inhibitors that 

30ses coZ t V' !!"" treatment ° f any diS6aSe ' - " r » '"ressive proteina- 
30ses constitute a threat to the health of man. 

described hv 1 ^" 6 PreSent SpeC,f1cU,on the Potion of aj, variants Is 
described by the construction of a hybrid macroglobulm. It is clear to the 

I'll PerSOn * he art that cha "S es »e obtained through other 

35No Jo' ZZT" 9 " Kth ° dS - S " Ch international Pub ication 

35NO. HO 89/06279 (NOVO INDUSTRI A/S). Also it is clear that other - 

macroglobulins could be employed instead of the human such as those 

ment,„ned in Sottrup-Jensen. t. et^ <„ 8 g,, ^ 
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,,„... ra nTEiH rflRRTFR IN F ""MF RFPI ACEMFNT THERAPY ^ 

molecules such as proteins and nucleic acids. When reacts w.th and forms 
cZlex with a proteinase in solution, ^ may bind other ^prote.ns , so 
5„o„-prote1nase proteins) present in that solution (Salvesen, G.S. et al 
8 Biochem. V 195, 453-46.). In the case of Fabry's disease, wh,ch _ 
X-chromosome linxed disorder of glycosphingolipid metabo , ,t 
recently been demonstrated that ej, can function as a carrrer n 
model of enzyme replacement therapy (ERT) (Osada, T., et al n«»* 
lOBiophys Res. Commu. 142: 100-106). «# was conjugated to coffee bean o 
alaczoiiLe through the action of trypsin, and the for^d CO.* e * _ was 
internalized through ^-receptor specific (Van Leuven, F.. et al., (1981) J. 

o . Chem. 256: 9016-9022, endocytosis and delivered to the 
U the target organelle for preceptor mediated interna ,zati . 
,5prote1nase complexes (willingham, M.C. and Pastan, I., (1980) tell 21. 

77 '" such a scheme in ERT provides a method of internalization to the 

lysosome of the enzyme in question and at the same time it might allevnate 
potent antigenicity problems arising from the use of heterologous enzym 
20 in therapy. One limitation in this type of ERT (Osada, T., et al., (1987 
. o em. P B1ophys. Res. Co-. 142: 100-106, would be the types of p< , ent 
target cells that could be treated by this protocol. Obviously, they would 

o express the .preceptor. In a future development of 
possibility might exist to redesign the cell specificity o f * ,v> ern 
25tion by exchanging the receptor binding domain of oji w,th ther ce to 
ligands. Hereby oJ4-mutants could be designed to enter any cell type 
to express a specific Internalizable receptor. 

This type of development would of course requ,re a system for 
the production of recombinant derived o^. The use of native human o* as J, 
30carr er in ERT (as described above) is undesirable due to the now we known 
risks of the employment of blood derived products in the treatment of human 

d,S " Se ' The production of recombinant in accordance with the present 
invention alleviates this problem by providing for large scale products 
35 of ra,H. 

m.H K ft QUA CA P°'f° SEME THERAPY ■ , 

Advances in gene transfer into mammalian cells have opened for 
the possibility of the treatment of a number of genetic disorders through 
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gene therapy. A major problem in gene therapy will be the specific targeting 
of genes into the appropriate cells within the body. (Williamson, B., (1982}- 
Nature 298: 416-418; Anderson, W.F., (1984) Science 226: 401-409; Parkman, 
R., (1986) Science 232: 1373-1378). — 
5 It was recently described that a constructed foreign gene 

containing the chloramphenicol acetyl transferase (CAT) on a bacterial plasmid 
could be targeted to the liver of rats by specific receptor directed 
internalization (Wu, G.Y. and Wu, C.H. (1988) J. Biol. Chem. 263: 14621- 
14624). The DNA carrier consisted of a galactose-terminal (asialo)glyco- 

10 protein and asialoorosomucoid covalently linked to poly-L-lysine. The 
polycation poly-L-lysine can bind DNA in a strong non-covalent and nondamag- 
ing interaction. It was demonstrated that complex bound DNA was internalized 
by cell -surface asialoglycoprotein receptors that are unique to hepatocytes. 
The complex was injected intravenously, and upon analysis only the liver 

15 expressed the CAT activity. 

In the present invention the use of rar^ as a carrier of DNA in 
gene therapy is suggested. Reaction of raj* with a proteinase such as trypsin 
or with methylamine in the presence of covalently closed circular plasmid DNA 
is likely to result in partial or total entrapment of DNA within the 

20complexing or 2 M molecule. After intravenous injection of such complexes with 
exposed receptor binding domains, the complex will be rapidly cleared from 
the blood and internalized in specific target cells, such as hepatocytes and 
Kupffer cells* Through protein engineering on the receptor binding domain of 
ro 2 M it will be possible to design a DNA carrier specific for other cell 

25 types. The advantage in this system as compared to the above described system 

f . using the asialoglycoprotein receptor is, that it will not be necessary to 
identify different DNA carrier systems for each new cell type. 

30 EXAMPLES 

Materials and methods: 
Microorganisms and cell lines 

E. coli K12 (MC1061) is available" from e.g. ~ Stratagene Inc., 
35 11099 North Torrey Pines Rd., La Jolla, California 92037. 

HepG2 (Human hepatoblastoma cell line) is freely available from 
American Type Culture Collection, under No. HB 8065. 

BHK (Syrian Hamster Kidney cell line, thymidine kinase mutant 
line tk*sl3, (Waechter and Baserga (1982) Proc. Natl. Acad. Sci. USA 79: 
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1106-1110); is freely available from American Type Culture Collection, under 
No. CRL 1632. 

Plasmids and vectors 

5 

Plasmids pCDVI-PL and pSP62-K2 are available from Dr. Tasuku - 
Honjo, Faculty of Medicine, Kyoto University, Kyoto 606, Japan. pSP62-K2 was 
derived from the plasmid pSP62-PL (available from New England Nuclear/Du 
Pont (U.K.) Ltd., Wedgwood Way, Stevenage, Hertfordshire, SG14QN) as 

10 described (Noma et al., (1986) Nature, 3J9: 640-646). pCDVI-PL was derived 
from pcDVl (Okayama, H. and Berg, P. (1983) Molec. cell. Biol. 3: 280-289) 
as described (Noma et al., (1986) Nature, 319: 640-646). 

M13mpl8 is available from Pharmacia LKB Biotechnology (catalog 
# 27-1552-01) (Norrander, J., Kempe, T. and Messing, J. Gene 26: 101-106, 

151983). 

M13mpl9 is available from e.g. International Biotechnologies, 
Inc., P.O. Box 9558, 275 Winchester Avenue, New Haven, Connecticut 06535, 
USA. 

pDHFR-I is available from Dr. K.L.Berkner, ZymoGenetics Inc., 
204225 Roosevelt Way NE, Seattle, Washington 98105. (The construction of this 
plasmid is given in detail in: Berkner, K.L. and Sharp, P. A. (1984) Nucleic 
Acids Res. 12: 1925-1941). The molecular cloning of the DHFR cDNA present 
in this plasmid, and its sub-cloning in mammalian expression vectors under 
the control of adenovirus derived promoters has previously been described 
25 in detail (Chang, A.C.Y., et al . , Nature 275: 617-624 and Kaufman, R.J. and 
Sharp, P. A. (1982) Mol . Cell. Biol. 2: 1304-1319) . The backbone plasmid in 
pDHFR-I is pBR322 (Sutcliffe, J.G. (1979) Cold Spring Harbor Symp. Quant. 
Biol. 43: 77-90; Sutcliffe, J.G. (1978) Nucleic. Acids Res. 5: 2721-2728). 

pUC13 is described in: Vieira, J. and Messing, J.: 1982, Gene 19: 
30 259-268 and available from Pharmacia LKB Biotechnology (catalog # 27-4954- 
01). 

pUC19 is described in: Yanisch-Perron, C. and Messing, J., 1985, 
Gene 33:103-119 and available from Pharmacia LKB Biotechnology (catalog # 
27-4951-01). 

35 



<WO 9103SS7A1J_> 



REPLACEMENTShEET 



WO 91/03557 



Growth media 
LB- broth: 
Mix 



PCT/DK90/00225 

15 



227 g Bacto Tryptone, Difco 0123-01 
113.5 g Yeast extract, Difco 0127-01, and ~~ 
5 227 g NaCl in a seal able plastic container. 

Add 12.5 g mix to 500 ml water in a 1000 ml bottle, shake well and sterilize 
in an autoclave. 

Dulbeccos Modified Eagle Medium is available from e.g. Gibco Ltd. 
10 P.O. Box 35, Trident House, Renfrew Road, Paisley PA34EF, Renfrewshire, 
Scotland. Cat.# 042-250 1M (10 * concentrate). 

Antibodies 



15 Anti-arjM A033 and peroxidase conjugated anti-a 2 M PE326 were from 

DAKOPATTS A/S, Copenhagen, Denmark. 

EXAMPLE 1. 

CLONING AND .SFQMFN CE DETERMINATION OF HUMAN r*M 
20 ~ 

Preparation of messe nger RNA from the human cell line HepjG2. 

The human hepatoblastoma cell line HepG2 (American Type Culture 
Collection No. HB 8065, freely available) was used as a source for mRNA 
preparation. HepG2 cells were grown to a total cell number of 15 * 10 7 in 
25Dulbecco's Modified Eagle medium containing 10% fetal calf serum and 
antibiotics. 

Total RNA was isolated by the guanidinium thiocyanate method 
(Chirgwin et al., (1979) Biochemistry 18: 5293-5299) and purified by CsCl 
gradient centrifugation. A total of 3000 119 RNA was obtained. mRNA was 

30 isolated by use of an ol igo(dT)-cellulose column (Aviv & Leder (1972) Proc. 
Natl. Acad. Sci. USA 69: 1408-1412). 60 ng of mRNA was obtained after one 
cycle of affinity chromatography. After ethanoV precipitation, this 
preparation of mRNA was resuspended in 10 mM Tris-HCl pH 7.5, 0.1 mM EDTA- 
Na 2 at a final concentration of 1 /ig//il and stored at' -80^C for subsequent 

35 use in the construction of a cDNA library. 

Construction of a cDNA library from HepG2 mRNA. 

A cDNA library was constructed in the pCDVI-PL/pSP62-K2 vectors 
(Noma et al., (1986) Nature, 319: 640-646. Available from Dr. Tasuku Honjo, 
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Faculty of Medicine, Kyoto University, Kyoto 606, Japan) by use of the 
methods described by Okayama & Berg (Mol. Cell. Biol. 2: 161-170 (1982); 4 
Mol. Cell. Biol. 3: 280-289 (1983)). _ 
E. coli K12 (MC1061) (Casadaban & Cohen (1980) J. Mol. Biol. 

5138: 179-207) was used for transformation. MC1061 were grown in L-broth at 
37° C to ODe^O.5. Twenty ml were centrifuged, and the pellet was resuspended- 
in 7 ml of ice-cold sterile 0.1 M CaCl 2 , incubated on ice for 30 minutes, 
centrifuged briefly, and finally kept in the cold room overnight. 

Ninety-five |il suspension of transformation-competent E. coli 

10MC1061 were added per 10 fi\ of cDNA preparation. The mixture was incubated 
on ice for 30 minutes, heat-shocked at 43,5°C for 45 seconds, and finally, 
after addition of L-broth, incubated at 37°C for 30 minutes. 

After resuspension, the cells were plated onto L-broth plates 
containing ampicillin (50 /ig/ml) and grown for 8 hrs at 37"C. A total of 2.9 

i5*10 5 individual colonies could be obtained from this library. 

Screening of the HepG2 library for cDNA clone s encoding human ot,M,. 

5 * 10* individual colonies were screened by standard colony 
hybridization technique using nitrocellulose filters (Maniatis et al . , (1982) 
20Molecular Cloning - A Laboratory Manual, Cold Spring Harbor, New York). 
A 20-mer oligonucleotide mixture 
5' CC(T/C)TTCAT(G/A)TC(T/C)TC(T/C)TG(T/C)TT 3' 
where the notation (X/Y) means that either of the nucleic acids X or Y may 
be used, complementary to the human e^M mRNA in the region encoding amino 
25 acid residues Lys-Gln-Glu-Asp-Met-Lys-Gly (residues number 493 - 499 in 
Sottrup-Jensen et al., J. Biol. Chem. 259: 8318-8327 (1984) was synthesized 
(on a DNA synthesizer from Applied Biosystems, USA), labelled with (using 
T 4 polynucleotide kinase and -y-^P-ATP) to a specific activity of 3 * 10 8 
cpm/pmol oligonucleotide. The labelled oligonucleotides were purified by gel 
30 chromatography and subsequently used in the screening of the cDNA library. 

The hybridization solution contained 6 * SSC, 5 * Denhardt's 
solution, 0.05% SDS (Maniatis et al . , (1982) Molecular Cloning - A Laboratory 
Manual, Cold Spring Harbor, New York) and_ 10'__cpm/m\ of labelled oligo- 
nucleotide mix. 

35 Hybridization was performed for 3 hrs at 45' C. Then the filters 

were washed in 6 * SSC, 0.05% SDS at 45*C for 3 * 10 minutes. After- autora- 
diography the filters were washed under the same conditions, but this time 
at 52'C. A colony that still showed hybridization at this temperature was 
isolated and the cDNA insert of the corresponding plasmid (designated po^M) 
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from this isolate was sequenced (Tabor 
Sci. USA 84: 4767-4771). The sequence 
amino acid sequence are shown in the 
N0:1:, and SEQ ID N0:2:. 



& Richardson (1987)" Proc. Natl. Acad, 
of the cDNA and the derived encoded - 
appended sequence listings, SEQ ID 



Charactpriyati pn of p »_M 

po^M had a cDNA insert of approximately 4.6 kb. Its sequence is 
given in Table I above. 

10 The se£ l u ence in Table I demonstrates that the entire coding 

region of tta M including the signal peptide is found in the insert. 

In addition to the coding region, the insert contains sequences 
derived from the 5'- and 3' untranslated regions of the fta M mRNA molecule. 

The amino ac1d sequence of the human a 2 M as deduced from the cDNA 
15 in por^ is in total agreement with the published sequence (Sottrup-Jensen et 
al., (1984) J. Biol. Chem. 259: 8318-8327). Codon number 1000 (numbered from 
the initiating methionine.codon in the signal peptide) was found to be ATC 
encoding an isoleucine and not GTC (encoding a valine) as found in an afl cDNA 
synthesized from human liver mRNA (Kan et al., (1985) Proc. Natl. Acad. Sci 
20 USA. 82: 2282-2286). In the a* cDNA sequence from the HepG2 library we have 
further identified ten silent changes as compared to the sequence from the 
Mver library, see the following Table I: 
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TABLE I 




5 


Codon 


Liver 


HepG2 




413 (Asn) 


AAC 


AAT 




495 (Phe) 


TTT 


TTC 


10 


750 (Gly) 


GG6 


GGT. 




796 (Leu) 


CTT 


CTC 


15 


835 (Leu) 


CTT 


CTA 




1266 (Ala) 


GCC 


GCA 




1296 (Asn) 


AAT 


AAC 


20 


1326 (Thr) 


ACC 


ACA 




1442 (Leu) 


CTC 


CTG 


25 


1460 (He) 


ATC 


ATT 



The position of the oligonucleotide mixture used as a hybridiza- 
tion probe in the colony screenings was from position 1574 to position 1594 
30 and the position of the reactive thiol ester is from position 2939 to 2953 
in SEQ ID N0:1. 

EXAMPLE 2. 

Construction of a mamma lian expression vector for o^,. 

35 ptt2 M was digested (fig. la) with Xbal and EcoRI, and a 1.2 kb 

fragment containing the 5' part of the o^ cDNA together with the multiple 
cloning site of pSP62-K2 was isolated on an agarose gel and cloned in an 
XbaI/£coRI digested M13mpl9 vector to generate M13mpl9A. To facilitate 
further subclonings of the cDNA, a unique EcoRV site was introduced in 

40the 1.2 kb fragment 10 nucleotides 5' to the initiating ATG (methionine) 
codon through site directed mutagenesis (Kunkel et al., (1987) Methods 
Enzymol. 154: 367-382). In the same- mutagenesis experiment, in which the 

mutagenic oligonucleotide N0R593: 

5' (TTCTTCCCCATGGTGGATATCGAAGGAGCTG) 3 ' 

45was used, the 5 nucleotides 5' to the methionine codon was changed to 
CCACCAJG; this mutation creates a new Ncol site spanning the ATG codon. A 
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correct mutant M13mpl9B was identified through restriction enzyme digestion 
and ONA sequencing. 

The mutated 5' end of o^M cDNA was isolated from M13mpl9A repli- 
cative form through digestion with Hindlll and EcoRI and agarose gel electro"- 
5 phoresis. The isolated DNA fragment was then joined to Hind I II/ EcoRI digested 
po^M through ligation to generate pll36. In this plasmid the cgi cDNA is 
reassembled in its total length, but now with a unique EcoRV site at the 5' 
end. pi 136 was digested with EcoRV/Dral, and the fragment was isolated on 
an agarose gel and cloned in a mammalian expression vector under control of 

10 the adenovirus 2 major late promoter (Ad 2 MLP). 

The adenovirus-promoter based vector was constructed by K.L.Berk- 
ner (ZymoGenetics Inc., Seattle, WA.), and a detailed description of the 
functional elements in the mammalian expression vector is given in: Powell, 
J.S. et al., (1986) Proc. Natl. Acad. Sci. USA 83: 6465-6469 and in: Boel 

15 et al., (1987) FEBS Lett. 219: 181-188). 

The expression vector used for expression of human was 
generated from the mammalian expression vector pPP (Boel, E. et al., (1987) 
FEBS Lett. 219: 181-188), in which human pancreatic polypeptide cDNA was 
cloned under control of Ad 2 MLP. 

20 P pp was digested (fig. lb) with BamHI and the resulting stag- 

gered ends were repaired with DNA polymerase (Klenow fragment and the four 
deoxynucleotide triphosphates). The 4.5 kb EcoRV/Dral o^M cDNA fragment was 
joined to this vector through ligation, and correct recombinants were 
characterized through restriction enzyme analysis on isolated miniprep. 

25plasmids. 

The a 2 M-mRNA transcribed from the resulting 8.76 kb plasmid 
(designated pll67 (fig. 2)) has the adenovirus 2 late tripartite leader (Ll- 
3) at its 5' end together with an mRNA splice signal (SS). At the 3' end of 
the construct the transcript is terminated with the SV40 late termination - 
30 and polyadenylation signal. 5' to the Ad 2 MLP the construct includes the 
SV40 enhancer (ENH) and the 0 to 1 (0 - 1) map units from adenovirus 5. 

Expression of g,M in mammalian cells. 

For expression of human o£M in cultured BHK ce'lTs (Syrian Hamster 
35 Kidney, thymidine kinase mutant line tk*sl3, (Waechter and Baserga (1982) 
Proc. Natl. Acad. Sci. USA 79: 1106-1110); American Type Culture Collection 
CRL 1632) the expression vector p!167 was co-transfected with pDHFR-I (Berk- 
ner, K.L. and Sharp, P. A. (1984) Nucleic Acids Res. 12: 1925-1941. Available 
from K.L.Berkner, ZymoGenetics Inc. Seattle) into subconfluent cells by the 
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calcium phosphate mediated transfection procedure (Graham and Van der Eb 
(1973) Virology 52: 456-467). In the transfect.ion experiment the molar ratio 
between pll67 and pDHFR-I was 10:1. Cells were grown in Dulbeccos Modified 
Eagle Medium supplemented with 10% fetal calf serum (FCS). 
5 Forty-eight hours after transfection, cells were trypsin! zed and 

diluted into medium containing 400 nM methotrexate (MTX). After 10 to 12. 
days, individual colonies were cloned out and expanded separately. The 
expanded cultures were propagated for 24 hours as described above, and 
producer clones were identified using an enzyme linked immunosorbent assays 
10(ELISA) (Munck Petersen C, et al., (1985) Scand. J. Clin. Lab. Invest. 45: 
735-740) against human a 2 M secreted to the growth medium. 

Description of the aJfi FI TSA assay. 

The materials used in the ELISA were: 
15 Catching antibody A033 anti-ar^, 

Peroxidase-conjugated anti-o^M antibody PE326, 

1, 2- Phenyl enedi amine, di hydrochloride (0PD) 

all from DAK0PATTS A/S, Copenhagen, Denmark. 

Urea peroxide, 125 mg, was from Organon Teknika. 
20 96 well ELISA plates were from NUNC, Copenhagen. 

Coating buffer: 

100 mM carbonate buffer pH 9.6 was made up as follows: 
Add 3.18 g Na 2 C0, and 5.96 g NaHC0 3 to 1000 ml water. 

25 

Standard and sample buffer: 

To 100 ml of 150 mM phosphate buffer pH 7.2 was added: 

50 fi\ Tween 20 

2 g Bovine Serum Albumin (Sigma A 7030). 

30 

Washing buffer: 

10 mM sodium phosphate pH 7.4 
145 mM sodium chloride 
0.1 % Tween 20. 

35 

Citric acid-phosphate buffer, pH 4.9: 

The following reagents were added to 1000 ml of water 
7.3 g citric acid 
23.88 g Na a HP0 4 , 12 H 2 0 
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0.5 ml Tween 20 

The buffer was used for a maximum of 14 days, stored at 4*C. 

Urea peroxide solution: — 
5 125 mg urea peroxide was dissolved in 8.93 ml water. 

The solution was kept in the dark at 4°C. 

Coating of the plates for assay: 

The 96 wel1 P late was coated with 175 /il of the DAKO A033 
10 antibody diluted 1:1000 in the coating buffer. The plate was incubated over 
night at 4'C. Before use the plate was washed 4 times in washing buffer. 

Application of standards and samples: 

100 ^ stan <*ard or sample was added to each well. As a standard 

15 purified human <*H, 2 mg/ml (prepared as described in: Sottrup-Jensen et al 
(1983) Ann. N.Y. Acad. Sci. 421: 41-60) was used. The standard curve included 
the following serial dilutions: 1:4000, 1:8000, 1:16000 etc. down to 
1:1024000, corresponding to final concentrations from 500 fig/1 down to 1 95 
mn. All dilutions were done in the Standard and sample buffer. The plate 

20was incubated over night at 4'C and then washed 4 times with wash buffer 
before the next step. 

Addition of conjugated antibody: 

100 /il of PE326 > wnic " been diluted 1:6000 in the Standard 
25 and sample buffer, was added to each well. The plate was incubated for 2 h 
at 20°C, and then washed 4 times with wash buffer. 

Enzyme activation: 

8 mg of 0PD was dissolved in 12 ml of Citric acid- phosphate 
30 buffer. To this solution 500 „1 Urea peroxide solution was added and the 
mixture was used immediately. 100 /,! of the final solution was added to each 
well, and the plate was incubated in the dark for 6 min. Then 100 /il of 2 M 
HaS0 4 was added to each well and the A^ was read in an automated EL ISA plate 
reader. - — — - • 

35 

The above described EL ISA did not give any background on medium 
supplemented with 10% FCS, nor did it give any background in BHK cell 
conditioned medium. Of 24 isolated MTX resistant clones, 16 produced 
detectable amounts of recombinant afl. 
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Selected cell lines that secreted 12.3 mg/1 (K16-6) and 19.1 
rog /l (K17-6) in the supernatant (grown in a 6 well NUNC-plate) over a 48 r 
hour period were expanded for large scale production of recombinant human _ 

5 

Purification of recombinant human ot-J-L 

Cell lines K16-6 and K17-6 were each expanded into one ten- 
double tray (NUNC, Denmark) with a growth surface of 6000 cm*. At 80% 
confluency the medium on the cells was changed from containing the 10% fetal 
10 calf serum (FCS) down to 2%. After 48 hours of growth in medium with only 2/. 
(PCS), the medium was removed, and the cells were washed twice with serum 
free medium. Cells were then grown serum free for 4 to 5 days with change of 
serum free medium every two days. Conditioned medium was pooled and analyzed 

for ra 2 M by ELISA. . ■ • 

15 * The pooled conditioned medium from K16-6 and from K17-6 contained 

7.15 mg/1 and 21.5 mg/1 of m 2 M, respectively. 

The ra 2 M was purified according to published procedures (Sottrup- 
Jensen et al., (1983) Ann. N. Y. Acad. Sci . 421: 41-60). Briefly the 
conditioned medium was loaded onto a 10 ml Zn-Chelate column (Zn- 
20 iminodiacetic acid Sepharose 4B (Porath, 0. et al . , (1975) Nature 258: 598- 
. 599) equilibrated with 25 mM Tris-HCl P H 8.0, and washed with 100 ml 
phosphate buffered saline (PBS) P H 7.2 until A^ < 0.036. A second wash with 
20 mM sodium phosphate, 500 mM NaCl P H 6.2 was performed until A^ < 0 033 
The flow rate was 100 ml/hr and 3 ml fractions were collected. ra 2 M was eluted 
25with 100 mM EDTA P H 7.0 at a flow rate of 40 ml/hr. During elution 1 ml 

fractions were collected. 

Recovery of raj\ was 44%. The r*^ containing fractions were con- 
centrated to 1 ml on an Amicon devise equipped with a PM 10 membrane and 
then loaded onto a Superose 12 gelfiltration column (25 mM Tris-HCl, 150 mM 
30 NaCl P H 8.0). The rafl containing fractions were pooled and stored at -20 
until analysis. 

EXAMPLE 3. _ ... .„ 

nharacteriration of r pcombi nant human roJL 

35 , r^^i ration, at the tMnl ester- thormal fragmentation a nd 
methvl amine induced c leavage. 
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A number of different analyses were performed to evaluate the 
structural and biological characteristics of the human r<& as compared *o- 
a preparation of human plasma derived o^M, designated preparation LSJ39. 

An important structural feature of 0a M is the presence of the 

5 thiol ester. When heated to 95'C for 15 .in, the thiol ester will induce a 
peptide bond cleavage in the backbone of fta M at the position of the thiol 
esterified Glx-residue. This results in the fragmentation of the 180 kD o^ 
monomer into two polypeptides of 120 kD and 60 kD. Fig.. 3 shows an analysis 
of both the purified ro^M (from two transformed BHK cell lines) and the 

10 purified human plasma derived preparation LSJ39 on a 10-20% SDS polyacryl- 
amide gel. The different preparations, either native human or BHK cell 
derived recombinant were all heat treated to induce thermal fragmenta- 
tion before loading onto the gel. Molecular weight markers (from top to 
bottom: 180, 120, 92, 60, 43, 26, 14 and 6 kD) were applied to lanes 1 and 

158. Samples in lanes 2, 3 and 4 were not reduced before electrophoresis, while 
samples in lanes 5, 6 and 7 were reduced. Preparation LSJ39 was applied to 
lanes 2 and 5. ro^M K16-6 was applied to lanes 3 and 6, and rc^ K17-6 was 
applied to lanes 4 and 7. 

It was clear from the patterns of protein fragments on the gel, 
20 that both human o^M and the two ra^M preparations showed a considerable degree 
of thermal fragmentation. As expected, only the reduced samples displayed 
this fragmentation. In the nonreduced samples, the molecules migrated as the 
360 kD dimer. 

In the human plasma derived preparation LSJ39 (lane 5) a fragment 
25 migrating slightly faster than the 60 kD fragment could be observed. Lanes 

6 and 7 indicated the presence in the recombinant material of a similar 
faster migrating fragment. It is possible that this fragment represented a 
slightly underglycosylated variant of the 60 kD fragment. 

Methylamine (MA) and other small nitrogen containing nucleo- 
30philes will cleave the thiol ester and thereby inactivate the ester (Sottrup- 
Jensen, L., et al., (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. et al., 
(1981) Biochem. J. J95: 453-461). After MA induced inactivation of the thiol 
ester, thermal fragmentation of can no longer be observed. 

Fig. 4 shows a SDS-PAGE run similaf-to that's-howri in Fig. 3 (with 
35 respect to loaded samples), in which applied tta M and ro^M had been pretreated 
with MA. From this gel it was concluded, that the thiol ester of ro^M was just 
as susceptible to cleavage with MA as the thiol ester of native ajl. Upon 
reduction MA-treated aju and ra# migrated as a single 180 kD monomer species. 
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Lanes 5 of both Fig. 3 and 4 shoved an additional band of ^ 
approximately 85 kD. When a* is cleaved in the bait region by proteinases r 
present in the blood, it generates two fragments, each with a molecular 
weight of 85 kD. The human preparation LSJ39 (purified from serum) 

5 contained these cleavage products, while they could not be detected on this 
gel in the two m 2 M preparations. This indicated that the material secreted, 
from the transformed BHK cell lines was largely native uncomplexed or 2 M. Any 
a 2 M molecules, that have reacted with proteinases are inactivated and can 
not form additional complexes with other proteinases. Since the BHK cell 

10 does not produce any proteinases that forms complexes with the ra 2 M product, 
this cell is therefore well suited for production of recombinant human a 2 M. 

B. Reactin n with trypsin. 

Reaction with trypsin is a standard way of analyzing the proteinase-complex 
15 formation ability of ^ (Sottrup-Oensen, L. (1987) in: "The Plasma Proteins 
(Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpe , 
PC (1973) J. Exp. Med. 138: 508-521; Harpel , P.C., et al . , (1979) 0. Biol. 
Chem. 254: 8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem. 
254' 4452-4456). In this reaction trypsin will cleave at its target site(s) 
20 in the bait region of OfjM, and the resulting reduced cleavage products (85 kD) 
will migrate as a double band. Under nonreducing conditions the trypsin-o^ 
complexes will migrate as high molecular weight products. 

Fig. 5 shows the result of such an analysis (performed as 
described (Sottrup-Jensen, L. (1987) in: "The Plasma Proteins" (Putnam, F.H., 
25 ed ) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpel, P.C. (1973) 
J. Exp. Med. 138: 508-521; Harpel, P.C, et al., (1979) J. Biol. Chem. 254: 
8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem. 254: 4452- 
4456)) on the native human <*N preparation LSJ39 (lanes 2 and 5) and on ro^M 
from cell lines K16-6 (lanes 3 and 6) and K17-6 (lanes 4 and 7). The samples 
30 in lanes 2, 3 and 4 were not reduced before electrophoresis, while the 
samples in lanes 5, 6 and 7 were. Lane 5 shows that almost all of the human 
native was cleaved with trypsin, while the two preparations of ro^ were 
cleaved with an efficiency of approximately 80% or more. Without reduction 
of the complexes no low molecular weight products from the reaction between 
35 trypsin and the native a* or the BHK cell derived ra 2 M were seen on the gel. 
The 85 kD fragments derived from the recombinant material migrated somewhat 
faster than the human standard; as mentioned above the recombinant materi- 
al might be slightly underglycosylated. 
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When aji is reacted with methylamine, the thiol ester will be" 
inactivated, and changes conformation from the "slow" form to the "fast-" 
form (Sottrup-Jensen, L. (1987) in: The Plasma Proteins (Putnam, F.W ed ) 
2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Van Leuven, F. , CassimaTT, 
5J.-J. and Van Den Berghe, H. (1981) J. Biol. Chem. 256: 9016-9022). In this 
conformation it can no longer react rapidly with or form complexes with 
proteinases such as e.g. trypsin. 

Fig. 6 shows the results of a set of experiments that were run 
in parallel to the experiments described above and shown in Fig. 5. However 
lObefore reaction with trypsin the native human o^M and the ra^M used in this 
experiment had been treated with methylamine (Sottrup-Jensen, L., et al 
(1980) FEBS Lett. 121: 275-280). Under these conditions both the native <^M 
and the ra 2 M show a marked decrease in reactivity towards trypsin (80% or 
more of the or.M and rc^M monomers were migrating as a 180 kD polypeptide) 
ISThis indicates that trypsin does not rapidly cleave at the bait region in 
methylamine treated human ajl or in BHK cell derived rajl. 

In these types of experiments BHK cell derived ra 2 M has shown 
characteristics similar to those of native human a 2 M. 

20 C. Trypsin and methylamine induce d conformational chano* in a?M . 

As mentioned above the o 2 M molecule will undergo a conformational 
change both through complex formation with proteinases and through methyl- 
amine induced cleavage of the thiol ester. The change in structure results 
m an altered mobility on rate gels (Sottrup-Jensen, L. (1987) in: The Plasma 
25 Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, 
FL; Van Leuven, F., Cassiman, J. -J. and Van Den Berghe, H. (1981) J. Biol 
Chem. 256: 9016-9022); unreacted a# will migrate as a "slow" form, while 
reacted aji will migrate as a "fast" form. 

Fig. 7 and Fig. 8 show these conformational changes, as they 
30 appear after reaction with trypsin and methylamine, respectively (analyzed 
on 5-10% rate gels). 

Lanes 1 on both gels contain purified -human pregnancy zone 
protein (PZP) (Sand, 0. et al . , (1985) J. Biol. Chem. 260: 15723-15735), 
which is known to appear in both - a diinerir- (O)-and- a tetrameric (T) 
35 configuration. 

Lanes 2 on both gels contain unreacted human preparation 
LSJ39. Lanes 3 on both gels show the fast migrating form, resulting from 
reaction with trypsin and methylamine, respectively. Lanes 4 on both gels 
show the unreacted raj* preparation K16-6, and lanes 5 show the corresponding 
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fast forms. Lanes 6 on both gels show the unreacted ror^ preparation K17- 
6, and lanes 7 show the corresponding fast forms. 

It can be concluded that both complex formation between ror 2 M and^ 
trypsin and reaction of raj* with methylamine result in the appearance of 
5 fast migrating structures. These structures appear (as analyzed on rate gels) 
to be very similar to the structures obtained when human ot^ was allowed to- 
react with trypsin and methylamine. It is also evident from these figures 
that the rafl proteins showed a migration, which, when- compared to the 
migration of dimeric and tetrameric PZP on the gels, is in agreement with the 
10 finding that these molecules are produced and secreted from the BHK cells in 
the active tetrameric conformation. 

n. Chromatography of or-H o n a Superose 6 column. 

A Superose 6 column can partially resolve or 2 M molecules in the 

15 dimeric configuration from molecules in the tetrameric configuration 
(Sottrup-Jensen, L. unpublished). Human standard aj\ and rcr^ was analyzed 
on a 24 ml Superose 6 column (buffer: 25 mM Tris-HCl, 125 mM NaCl pH 8.0; 
flow rate: 1 ml/min; fraction size: 1 ml). Fig. 9 shows the diagrams obtained 
from the chromatography of purified human standard or 2 M and ra 2 M from the K17- 

206 and the K16-6 BHK cell lines. Tetrameric (Sottrup-Jensen, unpublished 
observation) will elute in fraction 12 on this type of column. It is evident 
from the chromatograms that both of the ror 2 M preparations eluted in fraction 
12, as did the human standard ■ a#. On this type of column, dimeric 
molecules will elute in fraction 14 and 15 (Sottrup-Jensen, unpublished 

25 observation). This type of analysis supported the results obtained from the 
rate gels (Figs. 7 and 8), that ro^M was secreted from BHK cells in a 
tetrameric configuration. 

F. Trypsin protectio n analysis. 

30 When trypsin is trapped inside the molecule, it retains its 

catalytic capacity towards low molecular weight substrates such as S-2222 
(N-benzoyl-L-Ile-L-Glu-Gly-L-Arg-p-nitroanilide). If trypsin is efficiently 
complexed with tt2 M, it will be protected against high molecular weight 
inhibitors such as Soybean Trypsin Inhibitor (STI) (Sottrup-Jensen, L. (1987) 

35in: The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic 
Press, Orlando, FL; Ganrot, P.O. (1966) Clin. Chim. Acta. 14: 493-501; 
Sottrup-Jensen, L. et al., (1981) FEBS Lett. 128: 127-132). 

K16-6 and K17-6 derived ra^ was compared with human plasma a^M 
in such a protection assay. 100 /xl aJU (in 25 mM Tris-HCl, 125 mM NaCl, pH 
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8.0) was mixed with 30 ,1 trypsin (0.5 mg/ml in 20 mM sodium acetate pH 5.0) 
After .ncubating for 2 mi „. 30 ,1 1 m g/ml STI (in PBS) was added. 10 „1 al^ 
quo s were removed after 2 and 4 m in. and each m ixed with 750 ,1 0.12 mM S- 
2222 (dissolved 0.1 M sodiumphosphate pH 8.0, 5% dimethyl sulfoxide) 

results of tl he Chan9e ^ abS ° rbanCe at 405 nm was corded for 2 min. The 
results of the assay are given in the following Table II: 



TABLE II 



15 



20 



> Prep. 


of a^. 


OjM in cuvette. 


Activity. 




A^min 




A^min/zxg 


Human 


LSJ39 


0.140 


5.00 


0.028 


K16-6 




0.111 


4.62 


0.024 


K17-6 




0.119 


4.87 


0.024 



lv th„ < F T th " e reSU " S " be c ° ncll,ded that ™* "ad essential- 
ly the same protection capacity for trvDsin ™i„,i <tti 

„,„.„,.. 3 trypsin against STI as compared with the 

protection capacity of human plasma a*. 

25 the orotprt " ^ " methy,a "' i " e b ° f °™ «» Protection assay, 

T T 1 " PaCUy dr ° PS *—««•»»• * similar assay as tha 
lot t ' methyUm1ne tr " Ud hUma " > U ™ °* «t.,L 17, 

w th 1 ih" - 6 C ° nClUd6d that against STI 

^with almost the same efficiency as did human plasma o^M. 

£■ Amino terminal aminn stmumr1m] „ f ^ 

lion „ ,„ Th i e ° retica1,J '- the ^ characterized in the present in.estiga- 
lon could „„, y be either bov1ne (contamjnant frm ' 

35 a" IT, 5 T"? fTOm " e C6,,) * der1 " d f ™ «^»'« " £ 

H cen H t aSm /' 167 - EUSA "* W US6d ^"- d ^ in 

Z^l C0 " d : k t,0 " ed medium ' wh « h - ""I, or without added fetal calf serum. 

amln t 6 , 6 °* «« >>«»" and to characterize the 

am no termma, processing of the recombinant product, amino. terminal amino 

40* 1 sTT determi " ati °" »» =n out K16-6 and K.7-6 r.* ,s 

The „ ( rf S ° ttrUp - JenSe "' L - •»"•. ("84, J. Biol. Chem. 253: 829 3-8303 

detected" 1 9 J 0 " KaS repeated f ° r 12 a "<" «- < d "tity of 

detected ammo acid derivative in each cycle, was in total agreement with the 



SOOCID: <WO_9t03SS7Ai_l_» 



REPUCSMENTSWFfrr 



WO 91/03557 



PCI7DK90/00225 



28 



amino terminal sequence of human a 2 M: Ser-Val -Ser-Gly-Ly -Pro-G n Tyr Me 
Val-Leu-Val-, whereas bovine <*H has the following am.no terminal sequence. 
Ala-Val-Asp-Gly-Lys-Pro-Gln-Tyr-Met-Val-Leu-Val- (unpublished, Dr. Torsten 
Kristensen, Department of Molecular Biology, University of Aarhus, Denmark.) 



5 

FXAMPLE 4. 



r nn ^r*inn and e ynr^ion of a bait region mutant, of human ft? M 

In the present example it is demonstrated that the bait region 
of human can be substituted by the bait region of human pregnancy zone 
10 protein (PZP) (Sottrup Jensen, L. , Folkersen, J., Kristensen, T. and Tack, 
B F Partial primary structure of human pregnancy zone protein: extensive 
sequence homology with human alpha 2-macroglobul in. Pror Natl Acad Sci 
» S.A. 81, 7353-7357, 1984; Sand, 0., Folkersen, J., Westergaard, J.G. and 
Sottrup Jensen, L. Characterization of human pregnancy zone . protein 
^Comparison with human alpha 2-macroglobul in. J.Biol .Chem. 260, ^723-15735, 
1985). The resulting ^ bait region mutant exhibited a proteinase inhibitor 
profile similar to that of human pregnancy zone protein. 

To facilitate substitution of DNA fragments encoding the bait 
region of human cDNA, target sites for the restriction enzymes PstI and 
20 Sacll were introduced at the 5' and at the 3' end of the cDNA region encoding 

the bait region. . 

The human a 2 M expression plasmid P 1167 was digested with BamHI and 
Clal, and a 2660 bp fragment, which carried the central part of the human 
a 2 M cDNA, was subcloned in the BamHI and C]al digested vector pSX191. 
25 This vector, which had previously been constructed, is a 

derivative of P UC19. It was constructed as described: P UC19 was digested 
with EcoRI and HindHI, and a synthetic linker with the following sequence 

30 MTTGCTACCC^CAGGAAT^ " «gg 

CCATGGGACGTCCTTAAGTTCGAATAGCTACCGTACGCCTAGGTCGA - N0R782 

was cloned in the digested P UC19 vector. The linker, Which was an annealing 
product from the two synthetic oligonucleotides N0R781 and N0R782 has 
35 cohesive ends that will ligate to the EcoRI and the HindHI sites of P UC19 
in such a way that these ligation sites are not regenerated in the P SX19 
vector. Thus P SX191 carried sites for Kpnl, PstI, £coRI, HindHI, Clal, Sfihl 

and BamHI . . iL _ 11T . 

The resulting plasmid P SX191a 2 M was digested with BamHI and 

40HindIII, and a purified 2.6 Kb BamHI/HjndIII fragment was cloned in 
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M13mpl8 to generate MlSmplSa^ for mutagenesis by described methods. A 
synthetic oligonucleotide N0R973, with the following sequence: 

5 ' ( TTC ATACTGCT6CAGCT6TGGACAC ) 3 ' 
was used to introduce a Pstl site at position 2102 (SEQ ID N0:1) in the cDNA 
5sequence, and a oligonucleotide (N0R974) with the following sequence: 

5' (AGCCACCCCCGCGGAGTTTACCAC)3' 
was used to introduce a SacII site at position 2271 (SEQ ID N0:1) in the 
cDNA sequence. These sites were chosen because they- did not introduce 
alterations in the encoded amino acid sequence, and they were within a 
lOconvenient distance of the bait region in human a^i cDNA. Both primers were 
used in the same mutagenesis experiment (Kunkel, T. A., Roberts, J.D. and 
Zakour, R.A. Rapid and Efficient Site-Specific Mutagenesis without Phenotypic 
Se1ection ' Methods in Fnrvmol 154, 367-382, 1987); dsDNA was isolated from 
mutated M13mpl8 a2 M plaques, and the DNA was digested with the restriction 
15enzymes Pstl and SacII. Correctly mutated recombinants, which had an insert 
of 160 bp,. were further analyzed by DNA sequencing (Tabor, S. and Richardson, 
C.C. DNA sequence analysis with a modified bacteriophage T7 DNA polymerase. 
Proc. Natl. Acad. Sci . U.S. A 84, 4767-4771, 1987). A 2.6 kb BamHI/Hindlll 
fragment from a correct o^M cDNA mutant (M13mpl8a 2 M#212. 1) was subcloned in 
20 a BamHI/HindHI digested pUC13 vector, and a correct subclone pl308 was 
isolated and characterized with BamHI/HindHI and Pstl/SacII double 
digestions and DNA electrophoresis. 

The Pstl/SacII fragment in pl308 can be excised and replaced 
with a different DNA fragment, which encodes bait region variants. The 
25 resulting new variants (bait region mutants or analogs) of cDNA can be 
isolated as BamHI/Clal fragments and subcloned back into BamHI/Clal digested 
expression vector pll67. 

In the present example DNA encoding the amino acids of the bait 
region for human PZP (Sottrup-Jensen et al . 1989, supra) was obtained from 
30 ligation, annealing and cloning of 8 synthetic oligonucleotide,. 

The DNA sequence of the synthetic fragment and the encoded amino 
acids as inserted into the or^ clone are given in SEQ ID N0:3, and comprises 
positions 2107 to 2305 and the corresponding amino acids. A Pstl site was 
introduced at the 5' end in the synthetic fragment, and sicli and BamHI sites 
35 were introduced at the 3' end. 

This synthetic 0.2 kb DNA fragment was cloned in a Pstl/BamHI 
digested H13mpl8 vector for DNA sequencing. DNA from a clone containing the 
correct sequence was digested with Pstl and SacII, and the purified 0.2 kb 
fragment was cloned in a Pstl/SacII digested and gel purified pl308 vector. 
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A correct recombinant, p267PZP, was characterized with restriction enzyme 
digestions, and from this plasmid, bait region mutated (* 2 M - PZP) cDNA was 
isolated as a 2.7 kb BamHI/Clal fragment and subcloned in a BjmHI/Clal _ 
digested t* 2 M expression vector pll67. The resulting plasmid, designated pl365, 
5was grown as a large scale plasmid preparation, purified by CsCl centrifuga- 
tion, and cotransfected with pDHFR-I into BHK cells. 

Through this procedure the nucleotides 2102 to 2275 in SEQ ID 
N0:1 was removed and replaced with nucleotides 2102 to 2305 in SEQ ID N0:3. 

The procedures for transfection, selection of bait region mutated 
10a 2 M (designated ra^-PZP) recombinants (with an or^ specific ELISA), large 
scale production and purification of mutated were as described elsewhere 
(EXAMPLE 2) in this application. 

r.harartPrization of the nroteina s * inhibitor specificity of a bait region 

15 mutant of human aJM. 

The purified recombinant aj>l mutant, ror 2 M-PZP, was characterized 
with respect to its inhibitor specificity profile against various proteina- 
ses by the use of previously described methods (Sand et al.1985). For 
comparison human plasma derived a 2 M and PZP were treated with the same set 
20 of proteinases in parallel reactions. The proteinases used were chymotryp- 
sin, elastase, trypsin and staphylococcus aureus Glu-specific proteinase. 
It has been reported (Sand et al.1985) that chymotrypsin and elastase show 
a rapid reaction with both PZP and o^M, while the reaction between the two 
proteinase inhibitors and trypsin and Staphylococcus aureus Glu-specific 
25 proteinase is quite dissimilar for PZP and aj\: both proteinases react rapidly 
with a 2 M, while the reaction with PZP is slow (Sand et al.1985). The reason 
for this difference in reaction rate with the different proteinases is 
believed to be due to the fact that the bait region in PZP contains strong 
specificity determinant for chymotrypsin and elastase, but none for trypsin 
30 and Staphylococcus aureus Glu-specific proteinase. 

The results of the analysis is presented in figures 10 to 13. 
Figure 10 illustrates the gel electrophoreses (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human o^, human 
PZP and rajM-PZP. Molecular weight markers (from top to bottom: 180, 120, 92, 
3560, 43, 26, 14 and 6 kD) were applied to lanes 1 and 8. All samples were 
reduced. Lanes 2, 3 and 4 show the cleavage products obtained from reaction 
of chymotrypsin with human plasma derived PZP, ra^-PZP and human plasma 
derived afl, respectively. The ratio of proteinase to inhibitor was 1:1. Lanes 
5, 6 and 7 show cleavage products from similar reactions at a ratio of 2:1 
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between proteinase and the three tested inhibitors. In all 6 lanes cleavage 
products (85 kD) could be identified. This indicated that ra^-PZP reacted 
with chymotrypsin with similar characteristics as did human plasma derived 
Q^M and PZP. — 

5 Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PAGE) of the reaction products from elastase treated human o^M, human 
PZP and ro^M-PZP. Molecular weight markers were the same as applied on the 
gel in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
products obtained from reaction of elastase with human plasma derived PZP, 
lOror^-PZP and human plasma derived o^M, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In all 6 lanes cleavage products (85 kD) could be identified. 
This indicated that rot^-PZP reacted with elastase with similar character- 
istics as did human plasma derived ajft and PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20% reducing 
SDS-PAGE) of the reaction-products from trypsin treated human a^, human PZP 
and ra^-PZP. Molecular weight markers were the same as applied on the gel 
in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
20 products obtained from reaction of trypsin with human plasma derived PZP, 
human plasma derived o^ and ra a M-PZP, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 3 and 6 cleavage products (85 kD) could be identified 
25 from the reaction between trypsin and tt2 M. In lanes 2, 4, 5 and 7 no cleavage 
products were observed from the reaction of trypsin with PZP and ra^-PZP. 
This result demonstrated that ra a M-PZP reacted poorly with trypsin as did 
human plasma derived PZP, while o^M was cleaved in the reaction with trypsin. 

Figure 13 illustrates the gel electrophoresis (10 - 20 % reducing 
30 SDS-PAGE) of the reaction products from Staphylococcus aureus Glu-specific 
protease treated human tta M, human PZP and ro^-PZP. Molecular weight markers 
were the same as applied on the gel in Fig. 2. All 'samples were reduced. 
Lanes 2, 3 and 4 show the cleavage products obtained from reaction of 
Staphylococcus aum.s Glu-specific protease with human' plasma derived PZP, 
35r tt2 M-PZP and human plasma derived tt2 M, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 4 and 7 cleavage products (85 kD) could be identified 
from the reaction between Staphylococci a »r 0 ,.c Glu-specific protease and 
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ajfi In lanes 2, 3, 5 and 6 much less cleavage product could be identified 
from the reaction of this proteinase with PZP and ro^-PZP. This result . 
demonstrated that m 2 M-PZP reacted poorly with the Sta phylococcus aureus ^ 
proteinase as did human plasma derived PZP, while or 2 M was cleaved in the 

5 reaction with this proteinase. 

It can be concluded that ra^-PZP showed the same pattern of_ 
reaction with four proteinases as did human plasma derived PZP. This pattern 
of reaction was different from the corresponding pattern obtained from 
reaction with o^M. Thus ra 2 M-PZP has been demonstrated to have a proteinase 
10 inhibitor profile similar to native PZP and dissimilar to ajl. Thus it has 
been demonstrated that the proteinase inhibitor profile of ^ can be 
modulated by substitution of DNA fragments encoding the bait region. 

The substitution as described in this invention did not destroy 
the activity of the proteinase inhibitor, and it is therefore demonstrated 
15 that functional macroglobulin hybrids can be constructed by substitutions 
(mutations) in the bait region. The finding will lead to the design of o^M- 
derivatives with new desired proteinase specificities. No doubt, these 
results could be extended to other macroglobulin based hybrids, in which the 
bait region can be modified at will to obtain new inhibitor specificities. 
20 Aggressive activity of proteinases is often a problem in relation 

to various diseases (e.g. the activity of elastase and cathepsin 6 in severe 
inflammation leads to tissue and organ destruction and failure). Inhibitors 
of such proteinases will be useful in drug design. In situations where the 
target site for the proteinase is known, but no inhibitor can be identified, 
25a 2 M can be engineered (mutated in the bait region) to obtain the desired 
specificity. In a situation where the target specificity of the proteinase 
in question is unknown, saturation mutagenesis or random synthesis of the 
bait region will lead to an indefinite number of target sequences that can 
be introduced and expressed in hybrid macroglobulins. These hybrids can be 
30 screened for proteinase inhibition, and the target sequence(s) can be 
identified. The resulting a 2 M analog can be produced and purified as described 
elsewhere in this invention. Upon injection into the circulation such 
analogs will inhibit and clear from the blood any proteinase of the given 
specificity. 

35 Introduction of protein analogs or mutants in the human body 

always raises the possibility for antigenicity. The generation of- a panel 
of 45 mouse monoclonal antibodies against human a& has been described (Van 
Leuven et al.1988; Delain et al.1988). None of these antibodies were directed 
against the bait region. This indicates that the bait region is not highly 
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antigenic and that mutants in this region of the molecule can be generated 
and used for therapeutical uses without risk for antibody development. .-- 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Novo Nordisk A/S 

(ii) TITLE OF INVENTION: Expression of Plasma Glycoprotei 
(iii) NUMBER OF SEQUENCES: 4 



(B) FILING DATE: 29-AUG-1989 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Hepatic 

(G) CELL TYPE: Hepatoblastoma 

(H) CELL LINE: HepG2 

(fx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4450 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GTCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 

Met Gly Lys Asn Lys Leu Leu His 




(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) ZIP: DK-2880 



ordisk A/S, Patent Department 




R: DK 4235/89, DK 4236/89, DK 4237/89 
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GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG PTP PAP i/io 
Val Ser Gly Lys Pro Gin Tyr Met Val Leu Sai p" Ser leu leu His 148 
" 30 35 40 



ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC TAC CTG AAT pap 
Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Eeu Ser Tyr let Z G^u 
4o 50 



55 



110 US 

net SIT fC< if rf a AC P f TG GTC TTT GTC CAG ACA GAC AAA TCA 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 

125 130 135 



180 



205 2 io 

uil d CT IL° J CC GTG GAG TTT GT T CTT CCC AAG TTT GAA GTA CAA 
His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lysine Jltv™ GT?i 
220 225 230 

vlf Thr SIf P CA f* 6 ? TA A I C ACC ATC TTG GAA GAA GAG ATG AAT GTA 
Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn Val 

240 245 



196- 



ThJ Slf tE SI? f T ?f T I" TTG GAG TCT GTC AGG GGA M AGG AGC 24* 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 

ou 65 70 



llu Vhl ill A?n CTG GCG Sf G MT GAC GTA CTC CAG TGT GTC GCC 292 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 

/:> 80 85 

HI a CT S TC £ CA AAG TCT TCA TCC MT GAG GAG GTA ATG TTC CTC ACT 3dn 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Set Phe Eeu Thr 
3U 95 loo 

SI? G?? SIf fC? n! P CA ? CC Jf A Sf A TTT MG AAG CGG ACC A ™ GTG 388 
val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 



436 



ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT AM 

He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser nit AsJ 484 
1W 145 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAP ATT pap mo 

Glu Asn Phe His Pro Leu Asn Glu Leu He Pro let SIl tJE He' G?n " 2 
ido 150 



GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG Rftfl 
Asp Pro Lys Gly Asn Arg lie Ala Gin Trp Gin Ser Phe Gin leu tlu 



628 



GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTP PAP 
Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser G ?S Pro III 
10& I 90 195 200 

G?J s" Tvr fCs SI? SI? S TA C ? G ^ TCA GGT GGA AGG AGA G AG 676 
my ber Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 



724 



772 
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Tr . rTr TrT rrr rTA tap ACA TAT 6GG AAG CCT GTC CCT GGA CAT GTG 
Uf S5 SI Sg 25 Tyr Thr T,r Cly Lys Pro Val Pro Sly His Val 
250 255 2*0 



820 



868 



ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 
Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 916 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn ber 
285 290 

CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 964 
His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu Lys 
300 305 310 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 1012 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 1060 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA" TTT GTG AAA GTG GAC TCA CAC TTT CGA 1108 
Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 

CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 1156 
Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 
365 370 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC' 1204 
Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala Asn 
380 3 85 3yo 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 1252 
Tyr Tyr ier AsA Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
395 400 40b 

ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 1500 
He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 
410 415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 1348 
Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 
425 430 43b 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTCTCC CCA AGC AAG 1396 
His Glu Glu Ala His His Thr Ala Tyr Leu VaT Phe~Ser Pro Ser Lys 
445 450 45b 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 1444 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 
460 465 4/ 
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Jhl nn ?£ A Sf 6 A S AT I AT ATT CTG MT GGA GGC ACG CTG CTG 1492 
Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu Leu 

475 480 485 



G?S lit fCc fic f TC J CC II C ™ T TAT CTG ATA ATG GCA ^ G GGA GGC 1540- 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu lie Met Ala Lys Gly Gly 

< * yu 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAu ATG 158ff 
lie Val Arg Thr Gly Thr His Gly Leu Leu Val Lys G?n Glu Asp Met 



675 680 

W fi? ?F « GT F ATG TGT CCA CAG CTT CAA CAG TAT GAA 

Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Gin Tyr Glu 

685 690 695 



1636 



1732 



515 520 

fC< Si £- T IF l CC A I C TCA ATC CCT GTG ™ G TCA GAC ATT GCT CCT 
Lys Gly His Phe Ser He Ser He Pro Val Lys Ser Asp He Ala Pro 
525 530 535 

vll All 25 FI f TC A I C l AJ G f T GTT TTA CCT ACC GGG GAC GTG ATT 1684 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 

540 545 550 

ffS S A I l»l a CA f M I AT GAT GTT GAA AAT TGT CTG GG C AAC AAG GTG 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 
Si) i> 560 565 

Asl Fen W IF ? GC P P CAA AGT CTC CCA GCC TCA CAC G CC CAC 1780 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 

5/0 575 580 

fin SI? ?£ A 5f G 5f T CCT CAG TCC GTC TGC GCC CT C CGT GCT GTG 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 

585 590 595 600 

fl A n SfJ AGC S TG CTG f TC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 
6 °5 610 615 

llr 5IT JCr iT CTG ? TA P G f A MG GAC CTC ACT GGC HC CCT GGG 
Ser Val Tyr A .i Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 

620 625 630 

Prl [IS XI fl A n rf G f C 5 AT S A GAC TGC ATC MJ CGT CAT AAT GTC 1972 
Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn Val 

b35 640 645 

Tvl A H KI n C ?P I AT 'ff T CCA GTA TCA AGT ACA MT GAA ^ 
Tyr lie Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 

650 655 66Q 

Isl Sit w dI C CTA S? G 5 AC ATG GGC TTA GCA TTC ACC AAC 2068 
Asp Met Tyr Ser Phe Leu Glu Asp Met ^ly Leu Lys- Al a -Phe- Thr- Asn 



1828 



1876 



1924 



2020 



2116 
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ATG CAT GGA CCT GAA GGT CTA CGT GTA GGT TTT TAT GAG TCA GAT GTA 2164 
Met His Gly Pro 61 u Gly Leu Arg Val Gly Phe Tyr Glu Ser Asp Val 
700 705 710 

ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA GAG CCT CAC ACG 2212 
Met Gly Arg Gly His Ala Arg Leu Val His Val Glu Glu Pro His Thr 
715 720 725 

GAG ACC GTA CGA AAG TAC TTC CCT GAG ACA TGG ATC TGG GAT TTG GTG 2260 
Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He Trp Asp Leu Val 
730 735 740 

GTG GTA AAC TCA GCA GGT GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC 2308 
Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val Pro Asp 
745 750 755 760 

ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT 2356 
Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala 
765 770 775 

GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GCC TTC CAG CCC TTC 2404 
Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin Pro Phe 
780 785 790 

TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC 2452 
Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu Ala Phe 
795 800 805 

ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC 2500 
Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys He Arg Val 
810 815 820 

AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTA GCT GTC CCA GTG GAG 2548 
Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro Val Glu 
825 830 835 840 

AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG 2596 - 

Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin Thr Val 
845 850 855 

TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG 2644 
Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe Thr Val 
860 865 870 

AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG ACT GAG GTG CCT 2692 
Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu Val Pro 
875 880 885 - 

TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC AAG CCT CTG TTG 2740 
Ser Val Pro Glu His Gly Arg Lys Asp Thr Variletys Pro* Leu- Leu 
890 895 900 

GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC AAC TCC CTA CTT 2788 
Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu 
905 910 915 920 
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TGT CCA TCA 6GT GGT GAG GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA 
Cys Pro Ser Gly Gly Glu Val Ser Glu Glu leu llr ilu fyt teu Pro 
925 930 935 



CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC TCA GTT TTr rra 
Pro Asn Val Val Glu Glu Ser Ala Arg Ala III vl? £ vIT lTu Gly 

945 950 

GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT CTT CTC CAG ATP 
Asp lie Leu Gly Ser Ala Met Gin Asn Thr G™ Tsn ill ilu G?n 



1120 1125 

Gift 55 S C T GG f* 6 5£* GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA 
Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His Sal 

lliU H35 H40 



2836 



2884- 



2932- 



955 960 9 6 5 

T AT r GC I GT GGA GAG CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC ?Qftn 
Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro A^n ?le 2980 
3/u 975 980 

tU SI? CTG ? AT I AT CTA MT GAA ACA CAG CAG CTT ACT CCA GAG ATC 30?R 
Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu lie 

990 995 iooo 

^ ler f!s Ala nl r? C J AT ? TC ^ ACT GGT TAC CAG AGA GAG TTG 
Lys Ser Lys Ala lie Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin Leu 

10 °5 1010 1015 

A?S lyr fvt Sis t2 AsI r GC I" T AC f C ACC TTT GGG GAG CGA TAT 
iyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr 

1020 1025 1030 

G G 5 Art Asn tin SS ?£ C t GG CTC ACA GCC TTT GTT CTG AAG ACT 
biy Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu Lys Thr 

1040 1045 

III GCC , GAA GCT CGA GCC TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC 3220 
Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu Ala His fie Tnr 22 ° 
1U0U 1055 1060 

GlS Ala flu ?i? t? CTC I CC £? G AGG GAG AAG GAC AAT GGC TGT TTC 
1065 TrP tn™ Ser Gln Arg G1n L * s As P Asn Gly Cys Phe 

10/0 1075 1080 



3076 



3124 



3172 



3268 



3316 



Xrg Ser Sel G?S S f TG ^ ? AT GCC ATA MG GGA GGA GTA GAA 
Arg ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val Glu 

1° 85 1090 1095 

Isl G?J SI? ?k C CK I CC GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT 3364 
Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu Glu lie 

11UU 1105 mo 

£ ffi K £ « R "T gC CGC MT KC^CTG TTT TGC CTG 3„ 2 

* * 1 3 11 ?ft i i or 



3460 
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TC6 GAG GAC CTG ACC TCT GCA ALL aal aii, uiu ™» 
Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val^Lys Trp He Thr LyS Q 

CAG CAG AAT GCC CAG FCC GGI ML ill .11 ACC CAG CAC ACA GTC GTG 
Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr Val Val 
1245 1250 1 

GCT CTC CAT GCT CTG TCC AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT 
Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr 

1265 l^ /u 



1260 



3508 



3556 



3604 



TAT ACC AAA GCA CTG CTG GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG 
lyl Thr L^ Ala Uu S« Ala Tyr Ala Phe Ala Leu Ala Gly Asn 61^ 
U45 1150 

GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG GAA GCT GTG AAG 
Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala Val Lys 
1165 11'° 11 

AAA GAC AAC TCT GTC CAT TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA 
Asp Z Ser Sal His Trp Glu Arg Pro Gin Lys Pro Lys Ala Pro 
1180 1185 

GTG GGG CAT TTT TAC GAA CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG 3652 
Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu Met 
H95 1200 1205 

ACA TCC TAT GTG CTC CTC GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC 3700 
Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr 
1210 1215 I 220 

TCG GAG GAC CTG ACC TCT GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG 3748 
Ser Glu Asp Leu Thr Ser / 
1225 1230 

CAG CAG AAT GCC CAG GGC GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG 3796 
Gin C 
1245 



3844 



GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC 3892 
Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly Thr Phe Ser Ser 
1275 1280 1 2 «5 



AAA TTC CAA GTG GAC AAC AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA 3940 - 

Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val Ser 
1290 IWS 1300 

TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA 3988 
Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu Gly 
1305 1310 1315 ^ u 

TGT GTC TAC CTC CAG ACA TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG 4036 
Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu Lys 
1325 1330 - 1JJ3 

GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT 4084 
Glu G1U Phe Pro Phe Ala Leu Gly Val Gin Thf-Leu-Pro thr ^Thr- Cys 
1340 1345 I 350 

GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC CTA AGT GTC AGT . 4132 
Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu Ser Val Ser 
1355 1360 1365 
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TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG 4180 
Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val Asp Val Lys 
1370 1375 1380 

£t ^If J CT 11° rV CCC CTG MG CCA ACA 6TG *M ATG CTT GAA 4228- 
Het /al Ser Gly Phe lie Pro Leu Lys Pro Thr Val Lys Met Leu Glu 

1385 1390 1395 HOO 

AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG 4276- 
Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn hYs Val Leu 
1405 Hio 1425 

ATT TAC CTT GAT AAG GTG TCA AAT CAG ACA CTG AGC TTG TTC TTC ACG 4324 
He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe Phe Thr 
1420 1425 1430 

SIT fI G GAA GAT J T £ CCA GTA AGA GAT CTC AAA CCA GCC ATA GTG AAA 4372 
Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro Ala He Val Lys 

1440 1445 

SIf It 1 5 AT | AC J AC GAG ACG GAT GAG TTT GCA ATT GCT GAG TAC AAT 4420 
l^n ASP Tyr Tyr 61 U Thr As P Glu Phe Ala He Ala Glu Tyr Asn 
1450 1455 1460 

All Prl If w AM a AT CJT Sf A MJ GCT TGA AGACCAC AAGGCTGAAA 4470 
Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 

AGTGCTTTGC TGGAGTCCTG TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT 4530 
AAAGACTTGA TGAATAAACA CTTTTTCTGG TCAAAAAAA 4569 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1474 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(E) FEATURES: bait region: 690-730 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
1 5 io 15 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 
20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr~Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 
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Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 90 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 
100 105 HO 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 " 150 155 loO 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg lie Ala 
165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 
260 265 270 

Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 
325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He" Thr~Lys teu -Ser Phe 
340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 
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Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr thr Asp 

390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 415 

Thr Ser Leu Thr. Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 

425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 

440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
H3U 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 

4/0 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 
485 490 495 3 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 
3UU 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 

o<3d 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 

bb0 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
bob 570 



575 



Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
01U 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 

630 635 . 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr 
645 650 - -...655. 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu, Glu Asp 
660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
D/:> 680 685 



ISOOCID: <WO_91035S7A1_l_> 



REPLACEMENTShEEr 



WO9./03S57 FCT/DK90/0022S 

44 

Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 
690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 in 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 
725 730 liX> 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 
740 745 750 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 
755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly lie Ser Ser Thr Ala 
770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 
805 810 01 

Tyr Leu Pro Lys Cys He Arg- Val Ser Val Gin Leu Glu Ala Ser Pro 
820 8 25 830 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 
835 840 8 * 5 

. Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 
850 8 55 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 8 70 8 75 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 
885 890 895 

Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 
900 905 910 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 
915 920 925 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 
930 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala Met Gin 
94 | 950 ' 953- 960 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 
965 970 975 

Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu 
980 985 990 
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Thr Gin Gin Leu Thr Pro Glu Il^Lys Ser Lys Ala He Gly ~Tyr Leu 
« 3 1000 2005 

Asn ThrGly Tyr Gin Arg Gl^Leu Asn Tyr Lys Hi^Tyr Asp Gly Ser 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 

1030 1035 1040 



Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr II 
1045 1050 3 1055 



Phe lie Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 
iUb0 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 
1U/3 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 
* u 1095 iioo 

Ijejhr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 

1110 1H5 H20 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 
H25 mo H35 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 
11 * u H45 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 
1133 H60 lies 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 

1175 H80 

Arg_Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 

liyu H95 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 
1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 
I" 0 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 
1 " 3 1240 1245 . 

Ser SerThr Gin His Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 

1255 - — 1260 ~- - - 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 

1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 
1285 1290 1295 
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Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 
1300 1305 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 
1315 1320 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 
1330 1335 i34U 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 . 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 
1365 1370 13/3 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe lie Pro Leu 
1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 
y 1395 H00 H05 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 
1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe' Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 iw 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 
1445 1450 

. Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 

Asn Ala 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: Y 

(iv) ANTI -SENSE: N _ . „_ .„ 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4480 
(D) OTHER INFORMATION: 
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(ix) FEATURE: 

(A) NAME/KEY: insertion seq 

(B) LOCATION: 2102.. 2305 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

6TCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 

Met Gly Lys Asn Lys Leu Leu His 
1 5 

p C ro stl fin SIT P f TC CTC TTG GTC CTC CTG "C ACA GAC GCC TCA 100 
Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro Thr Asp Ala Ser 

1U 15 20 

SI? I CT rf A CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG CTC CAC 14ft 
Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu His 

30 35 40 

fhJ itS Thr tH f* 6 Sf C I GT GTC CTT CTG AGC TAC CTG AAT GAG 196 
Inr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn Glu 

45 50 55 



thr SI? SI SI? f T ? CT I CC I TG GAG TCT GTC AGG gga AAC AGG AGC 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 
ou 65 ~- 



70 



52 



244 



292 



PIP. ll C f AC CTG GAG GCG GAG MT GAC GTA CTC CAC TGT GTC GCC 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
I* 80 85 

Phe All SI? d CA f* 6 I CT TCA TCC AAT GAG GAG G ™ ATG TTC CTC ACT 340 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 



388 



436 



SIl ni- SI? f** r? A £ CA ACC CAA GAA m AAG CGG ACC ACA GTG 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 

110 115 120 

51? S T I f* 6 AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 
125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT dft4 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser a£ 
140 145 i5o 

r??. i* C IF . C , AC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG w 
Glu Asn Phe His Pro Leu Asn Glu Leu lie Prp Lett. Val J£ JU GlS * 
135 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG c ftn 
Asp Pro Lys Gly Asn Arg lie Ala Gin Trp Gin Ser Phe Gin ™ Glu ®° 
1/u 175 180 
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rrT err PTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 
SS S5 2 £55 «2 Pta Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 

85 S S5 « 51? SS SS SK S Si B ffi » 5 g SS 

205 210 

CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 
ms Pro Phe Yhr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val Gin 
220 225 

SS S SS SS S K S! c e SS S! c , E SS 85 K SS S2 SIS 

235 240 

TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 2 55 * bU 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 
ill Val Ser lie Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 2 ?0 275 ^ 

GAA GAT TCA CAG GCT TTC TGTGAG AAA TTC AGT GGA CAG CTA AAC AGC 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn ber 
285 290 ^ 

r/iT rrr Trr TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 
ml G?$ Cys III ™ G^n 6t5 Val Lys Thr Lys Val Phe Gin Leu Lys 
300 305 310 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu lie mr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 
Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 *™ 

rnr rrA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 
lln GlJ III Pro Se III Gly Gin Val Arg Leu Val Asp Gly Lys Gly 
365 370 - S' 0 

nr CCl ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 
vl? Pro lie PrS tan L?s Val He Phe He Arg- Gly^sn Glu Ala Asn 
,380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 1^2 
Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe ber 
395 400 405 



628 



676 . 



724 



772 



820 



868 



916 



964 



1012 



1060 



1108 



1156 



1204 
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ftl Hn ? GC 11° J 60 £ CA TCA CAA AGT CTC CCA TCA CAC GCC CAC 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 
0/u 575 580 



sS SIT w f TG f TA f> CA GAA ^ G GA C CTC ACT GGC TTC CCT GGG 
Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 
620 625 630 



1300 
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ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 
He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 

415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 134ft- 
Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp 5al sir G?2 ^ 
qdb 430 435 440 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 
His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 
445 450 455 

sfr HI SI? S- C CTT G f G GCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 

460 465 470 

Thl £ A n ?£ A S T f £? G G ? A CAT TAT ATT CTG GGA GGC ACC CTG CTG 
Thr Gin Thr Val Gin Ala His Tyr lie Leu Asn Gly Gly Thr Leu Leu 

4/5> 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu lie Met Ala Lys Gly Gly 
qyu 495 500 

III vl? ArS THr ?S ?£ T £ AT ?f A CTG CTT GTG MG CAG GAA G AC ATG 
lie Val Arg Thr Gly Thr H 1S Gly Leu Leu Val Lys Gin Glu Asp Met 

510 515 520 

Us Gfv Sfl III l CC A T C I CA A I C CCT GTG MG TCA GAC ATT 6CT CCT 
Lys Gly His Phe Ser He Ser He Pro Val Lys Ser Asp lie Ala Pro 

525 530 535 

Si 51 £ GG T G CK A T C TAT GCT GTT TTA CCT A CC GGG GAC GTG ATT 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val lie 

540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala JXn Lys Val 
555 560 565 



1396- 



1444 



1492 



1540 



1588 



1636 



1684 



1732 



1780 



CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 187ft 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 
biSb 590 595 . 600 

As 0 , nJ 55 S T ? CTG f TC ATG A* 6 CCT GAT GCT GAG CTC TCG GCG TCC 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu- Leu -Ser- Al* Ser 

605 610 615 



1876 



1924 
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(TT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT CGT CAT AAT GTC 1972 
Pro Vet ten Asp ITn Asp Asp Glu Asp Cys lie Asn Arg His Asn Val 
635 640 645 

TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT ACA AAT GAA AAG 2020 - 

]yl III ten GlJ He Th? Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 
y 650 655 660 

GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 2068 
Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr Asn 

570 675 DOU 



665 



TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTG CAG TCA GTG TCA 
Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Ser Val ber 
685 690 °« 

GCC GGC GCC GTG GGA CAG GGA TAT TAT GGA GCC GGA CTG GGA GTG GTG 
Ala Gly Ala Val Gly Gin Gly Tyr Tyr Gly Ala Gly Leu Gly Val Val 

705 710 



2116 



2164 



GAG AGG CCT TAT GTG CCT CAG CTG GGT ACC TAT AAT GTG ATC CCT CTG 2212 
Glu Arg Pro Tyr Val Pro Gin Leu Gly Thr Tyr Asn Val He Pro Leu 
715 720 '25 

AAT AAT GAG CAG AGC TCA GGA' CCT GTG CCT GAG ACA GTG AGG AAG TAT 2260 
Asn Asn Glu Gin Ser Ser Gly Pro Val Pro Glu Thr Val Arg Lys Tyr 
730 735 740 

TTC CCT GAG ACA TGG ATC TGG GAT CTG GTG GTG GTG AAT TCC GCG GGT 2308 
Phe Pro Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly 
745 750 755 '<> u 

GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG 2356 
Val Ala Glu Val Gly Val Thr Val Pro Asp Thr lie Thr Glu Trp Lys 
765 770 * 

GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC 2404 
Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly lie Ser ber 
780 785 ' yu 

ACT GCC TCT CTC CGA GCC TTC CAG CCC TTC TTT GTG GAG CTC ACA ATG 2452 
Thr Ala Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met 
795 " 800 805 

CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC 2500 
Pro Tyr Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val 
810 815 820 

CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC 2548 
Leu Asn Tyr Leu Pro Lys Cys He Arg Val Ser Val~Gln Lea *Tu Ala 
825 830 835 840 

TCT CCC GCC TTC CTA GCT GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC 2596 
Ser Pro Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His 
845 850 8bi> 
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TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA izaa 

Cys He Cys Ala Asn Gly Arg Gin Thr Val Ser lr P Ala Val Thr Pro 2644 
860 865 870 

AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG AGC GCA GAG GCA PTA rar , £a , 

Lys Ser Leu Gly Asn Val Asn Phe Thr Val ?er Ala JnS AU 22 111 ^ 
8/5 880 885 

l CJ J M S? G CTG TGT GGG ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA ?7drr 
Ser Gin Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro gVu ml ITy 



Jfq fJJ A?o ThJ SIf ?i C ^ G ? T CTG 116 GTT GAA CCT GAA GGA CTA 
Arg Lys Asp Thr Val lie Lys Pro Leu Leu Val Glu Pro Glu Gly Leu 

GlS fOc rtJ ?£ A ? AC I CC CTA CTT TGT CCA TCA GGT GGT GAG 
Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu 
925 930 935 

SaT III fitu Glu 1™ E f I G "? ? I G EE! GCA "7 GTG GTA GAA GAA 2884 



Tvr IIp Pho n « H* *V A F, AC ATT ACC CAA GCC CT C ATA TGG CTC 

1065 6 ASP ?I?« A1a His Ile Thr Gln Ala Leu He Trp Leu 

lwo ° 1070 1075 — 



2788 



2836 



Ser Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu 
y4U 945 950 

III Ala Ara aTI P ?? ? A FI TTG GGA GAC ATA TTA GGC TCT GCC 
Ser Ala Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala 

yo:> 960 ges 

25 ITu !£ ThS mU r f 11 f TC ^f G ATG CCC TAT GGC TGT GGA GAG 
Met Gin Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu 

975 gso 

PlS J?I S TG S T f P C T I T GCT CCT MC ATC TAT GTA CTG GAT TAT CTA 
Gin Asn Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu 

990 995 1000 

US G?2 ThJ n G m G CTT ^ T £ CA GAG ATC MG TCC *** GCC ATT GGC 
Asn Glu Thr Gin Gin Leu Thr Pro Glu Ile Lys Ser Lys Ala Ile Gly 

1005 1010 1015 

lyl til ten ill nl ! AC J? G AGA £ AG TTG MC TAC AAA CAC TAT GAT 
»yr Leu Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp 

1020 1025 1030 

Gl5 III ?£ Spr ill IF r? G r AG « GA I AT GGC AGG AAC CAG GGC AAC 

iSk y G l-/ r9 Tyr Gly Ar 9 Asn Gin Gly Asn 

1035 1040 1045 

t£ 2S ThS aS ill FT CTG ^ JE T TTT GCC CAA GCT CGA GCC 3220 
1050 Phe ynL LeU Lys Thr Ph * AU-Gln-Ala-Arg Ala 

1U3U 1055 1060 

Hn 25 tlE £1 GAA . GCA CAC A JT ACC CAA GCC CTC ATA TGG CTC 

Leu 
1080 



2932 



2980 



3028 



3076 



3124 



3172 



3268 
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TCC CAG AGG CAG AAG GAC AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG 3316 
Ser Gin Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu 
1085 1090 1U3:> 

CTC AAC AAT GCC ATA AAG GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC 3364 
Leu Asn Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser 
1100 1105 1110 

GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC 3412 
Ala Tyr He Thr lie Ala Leu Leu Glu He Pro Leu Thr Val Thr His 
1115 1120 H25 

CCT GTT GTC CGC AAT GCC CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA 3460 
Pro Val Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr 
1130 1135 H40 

GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG 3508 
Ala Gin Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu 
1145 1150 H55 1160 

GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA 3556 
Ala Tyr Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val 
1165 H70 1175 

CTC AAG TCA CTT AAT GAG GAA "GCT GTG AAG AAA GAC AAC TCT GTC CAT 3604 
Leu Lys Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His 
1180 1185 H90 

TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA GTG GGG CAT TTT TAC GAA 3652 
Trp Glu Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu 
1195 1200 1205 

CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG ACA TCC TAT GTG CTC CTC 3700 
Pro Gin Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu 
1210 1215 1220 

GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC TCG GAG GAC CTG ACC TCT 3748 
Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser 
1225 1230 1235 1240 

GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG CAG CAG AAT GCC CAG GGC 3796 
Ala Thr Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly 
1245 1250 1255 

GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG GCT CTC CAT GCT CTG TCC 3844 
Gly Phe Ser Ser Thr Gin His Thr Val Val Ala Leu His Ala Leu Ser 
1260 1265 1270 

AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT GGG AAG GCT GCA CAG GTG 3892 
Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys~Ala Ala Gin Val 
1275 1280 1285 

ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC 3940 
Thr He Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn 
1290 1295 1300 
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AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA TTG CCA GAG CTG tTT CCC 
Asn Asn Arg Leu Leu Let, Gin Gin Val Ser leu Pro G?S 25 Pro G?y 

1310 1315 1320 



1370 " 1375* i3 8 o 

GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT 
Ala Ser Asn Met Ala lie Val Asp Val Lys Met Val Ser G?y Phe IU 

1390 1395 1400 

Pro 2S Us Pro ThJ SI? ? M £ TG f TT Jf* AGA TCT AAC CAT GTG AGC 
Leu Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser 

1405 1410 



1415 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 amino acids 

(B) TYPE: amino acid 



3988 



4036 



GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA TGT GTC TAC CTC CAG ArA 
Glu Tyr Ser Met Lys Val Thr Gly Glu Gly Cys Sal Tyr [11 ctn Jkr 
1325 1330 1335 

TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT dww 
Ser Leu Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Vhl AU 
1340 1345 1350 

Hu tu SI? Sff JF CTG GCT GAA ACT TGT GAT GAA CCC AAA GCC CAC 4132 
Leu Gly VaJ Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His 
1J:>:> 1360 1355 

S SS SK ffi S 25 SS S £ S & G S £ c S 4,80 



4228 



4276 



4324 



4372 



Ira ?h? SK SI? ? GC f C £ AT GK TTG ATT TAC CTT G AT AAG GTG 
Arg Thr Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val 

1420 1425 . 1430 

ler 2£ G?n iff lZ f° IF £ C AGG GTT CTG ^AA GAT GTC CCA 
i]5c LeU Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro 
1,J:> 1440 1445 

GTA AGA GAT CTG AAA CCA GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG 4420 
Val Arg Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp ™ Tyr Glu 4420 

1455 i45o 

^r AsJ "til HI A°a ffl SP r?" J AC AAT GCT GCT TGC AGC AAA GAT 4468 
1465 JJfn Ala 61 U Tyr Asn Ala Pro c * s Se »" Lys Asp 

14/0 1475 1480 

iVu Gly JJ„" All TGAAGACCAC AA GGCTGAAA AGTGCTTTGC TGGAGTCCTG 4520 

TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT AAAGACTTGA TGAATAAACA 4580 
CTTTTTCTGG TCAAAAAAA " *" 



4599 
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(D) TOPOLOGY: linear 

(E) FEATURES: bait region: 690-740 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
I 5 1° 10 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 
20 25 30 . 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 " 70 75 8" 

Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 9° y:> 

Asn Glu Glu Val Met Phe Leu" Thr Val Gin Val Lys Gly Pro Thr Gin 

ioo 105 n° 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 lbU 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg lie Ala 
165 170 I 75 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 I 90 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val ProLys lie lie Thr 
225 " 230 235 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 
260 265 2/0 
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Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
c/d ?an 



285 



Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 

" u 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 

610 325 



320 



His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 
325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 
J4U 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
Jo:> 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
° /u 375 380 

Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 

- 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 415 y 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
q£U 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 

440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
H3U 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 

470 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 
485 490 495 3 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 
500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 

535 - — 540- — * - - * 

Val Leu Pro Thr Gly Asp Val lie Gly Asp Ser Ala Lys Tyr Asp Val 

550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
555 570 575 
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Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
59 5 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 

Asp Cys He Asn Arg His Asn Val Tyr lie Asn Gly He Thr Tyr Thr 
r 545 650 D " 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 
660 665 &/u 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
675 680 685 

Cys Pro Gin Leu Gin Ser Val Ser Ala Gly Ala Val Gly Gin Gly Tyr 
690 695 700 

Tyr Gly Ala Gly Leu Gly Val Val Glu Arg Pro Tyr Val Pro Gin Leu 
705 710 715 

Gly Thr Tyr Asn Val He Pro Leu Asn Asn Glu Gin Ser Ser Gly Pro 
725 730 /<3D 

- Val Pro Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp lie Trp Asp 
740 745 750 

Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val 
755 760 765 

Pro Asp Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu 
770 775 780 

Asp Ala Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin 
785 790 795 

Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu 
805 810 81b 

Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys He 
820 825 830 

Arg Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro 
835 840 

Val Glu Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin 
850 855 860 

Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe 
865 870 875 ueu 



BNSDOCID: <WO 91 03557A 1 J_> 



qppi ACFMENT SHEET 



WO 91/03557 

PCT/DK90/00225 



57 



Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu 
885 890 895 

Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He Lys Pro 

yuu 905 — 



910 



Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser 

920 925 

Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys 

935 940 

Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val 
* 950 955 960 

Leu Gly Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu 
965 970 975 

Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro 
980 985 99 0 

Asn He Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro 
yyi > 1000 10 05 

Glu He Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg 
1UiU 1015 1020 

GlnUu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu 

1U,5U 1035 1040 

Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu 
1045 1050 1055 

Lys Thr Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu Ala His 
i uqu J 065 



1070 



He Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly 
lu/:> 1080 1085 

Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly 
XU3U 1095 iioo 

Va^Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu 

1110 H15 U20 

Glu He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe 
1125 n3o n35 

Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser 
11,0 H45- — H50. 

His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly 
1X00 1160 

Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala 

H75 H80 
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Val 
1185 



Lys Lys Asp Asn Ser Val His Trp Glu Arg^Pro Gin Lys Pro Lys 



1190 



1200 



Ala Pro Val Gly His Phe Tyr Glu Pro Gln^Ala Pro Ser Ala Glu^Val 



1205 



1210 



Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala 
1220 1225 1"U 

Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp He 
1235 1240 lc.10 

Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr 
1250 1255 1260 

Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr 



1265 



1270 



1275 



1280 



Arg Thr Gly Lys AlaAla Gin Val Thr IUGln Ser Ser Gly Thr^Phe 



1285 



1290 



Ser Ser Lys Phe Gin Val Asp Asn AsnAsn Arg Leu Leu Leu^Gln Gin 



1300 



1305 



Val Ser Leu Pro Glu Leu Pro" Gly Glu Tyr Ser Met Lys Val Thr Gly 
1315 1320 1325 

Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro 
1330 1335 1340 

Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val^Gln Thr Leu Pro Glr^ 



1345 



1350 



1355 



Thr Cys Asp Glu Pro Lys Ala His Thr SerHie Gin He Ser Leu^Ser 



1365 



1370 



Val 



Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala Ile^Val Asp 



1380 



1385 



Val Lys Met Val Ser Gly Phe lie Pro Leu Lys Pro Thr^al Lys Met 



1395 



1400 



Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn His 
1410 1415 1420 

Val Leu He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe 



1425 
Phe 



1430 



1435 



1440 



Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu_Lys Pro ATa^le 



1445 



1450 



Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala lie Ala Glu 
1460 I 465 1 

Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1475 1480 
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PATENT CLAIMS 

1. A process for the production of recombinant a-macroglobul in- 

variants, fragments or derivatives thereof, wherein a functionally operative 
expression vector comprising a gene encoding for the expression of a- 
5 macroglobulin, variants, fragments or derivatives thereof, or alleles of 
such a gene, is introduced into a suitable host capable of expressing satt 
gene, said host is cultured in a suitable nutrient medium containing sources 
of assimilable carbon and nitrogen and other essential nutrients, and the 
^ expressed or-macroglobulin or fragments or derivatives thereof is recovered. 

2 The process of claim 1, wherein said gene encodes for the 

expression of human or 2 -macroglobu1 in, variants, fragments or derivatives 
thereof. 

15 3 " The process of cl aim 2, wherein said gene encodes for the 

expression of human ^-macroglobulin having the amino acid sequence of SEQ 
ID NO: 2, or a fragment or. derivative thereof. 

4- The process of claim 2 or 3, wherein said gene comprises the DNA 
sequence of SEQ ID N0:1, or a fragment thereof. 

5- The process of claim 1 or 2, wherein said gene encodes for a 
variant or-macroglobulin, in which the amino acid sequence of the bait region 
has been altered. 

6- The process of claim 5, wherein the bait region has been altered 
by incorporation of further proteinase target sites. 

OA 7 " The P roce " of claim 5, wherein the bait region has been altered 

30 by removal of proteinase target sites. 

8. The process of claim 5, wherein the bait region has been altered 

by replacing one or more specific proteinase target sites with one or more 

ninovi Ms AM .Tj. « 



20 



25 



35 



other specific proteinase target sites. 

9- The process of claim 8, wherein said proteinase target sites are 

specific for bovine trypsin, Streptomvces grisens trypsin, papain, porcine 
elastase, bovine chymosin, bovine chymotrypsin, Staphvlocorrn, stra in 
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V8 proteinase, human plasmin, bovine thrombin, thermolysin, subtil isin Novo 
and/or strgptomvces- oriseus proteinase B. 

10. The process of claim 5, wherein wherein the bait region has been 
5 altered by replacing said bait region or part thereof with a bait region or 

a part thereof from another a-macroglobulin. 

11. The process of claim 10, wherein said bait regions originate from 
human *M, Pregnancy Zone Protein (PZP), rat a,M, rat o^, rat aj, variant 

10 1, or rat <*,I 3 variant 2 (a,I 3 = a t -inhibitor 3), especially PZP. 

12. The process of any of claims 5 to 11, wherein said gene encodes 
for the expression of human a a 2 -macroglobul in variant having the amino acid 
sequence of SEQ ID N0:4, or a fragment or derivative thereof. 

15 

13. The process of any of claims 5 to 12, wherein said gene comprises 
the DNA sequence of SEQ ID N0:3, or a fragment thereof. 

14. The process of any of the claims 1 to 13, wherein said gene is 
20 a synthetic gene. 

15. The process of any of the claims 1 to 14, wherein said a- 
macroglobulin, variant, fragment or derivative thereof is co-expressed with 
a desired gene product. 

25 

16. The process of any of the claims 1 to 15, wherein said gene is, 
or is derived from, a human gene. 

17. The process of any of the claims 1 to 16, wherein said host is 
30 a bacterial strain, a fungal strain, a mammalian cell line, or a mammal. 



18. 



The process of claim 17, wherein said host is a fungus. 



19. The process of claim 18, wherein said fungus belongs to the genus 

35 Aspergillus . 



20 



The process of claim 18, wherein said host is a yeast. 
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21. The process of claim 20, wherein said yeast belongs to the genus 
Saccharomvces . 

22. The process of claim 17, wherein said host is a mammalian ceTl 
5 line. 



23. 



The process of claim 22, wherein said mammalian cell line i 



Syrian Baby Hamster Kidney (BKH) cell line. 



s a 



10 24 * The Process of claim 23, wherein said cell line is available from 

ATCC under No. CRL 1632. 

25. a DNA sequence comprising a gene encoding for the expression of 

an a-macroglobulin, variants, fragments or derivatives thereof 

15 

26 The DNA sequence of claim 25, wherein said gene encodes for human 

a 2 -macroglobul in. 

27. The DNA sequence of claim 25, wherein said gene encodes for the amino 
20 acid sequence of SEQ ID N0:2 or a fragment or derivative thereof. 

28. The DNA sequence of claim 26 or 27, wherein said gene has the 
nucleotide sequence of SEQ ID N0:1 or a fragment thereof. 

25 29> The DNA sequence of claim 25 or 26, wherein said gene encodes 

for a variant a-macroglobulin, in which the amino acid sequence of the bait 
region has been altered. 

30. The DNA sequence of claim 29, wherein said bait region has been 
30 altered by incorporation of further proteinase target sites. 

31. The DNA sequence of claim 29, wherein said bait region has been 
altered by removal of proteinase target sites. 

35 3Z> Tne DNA sequence of claim 29, wherein said bait region has been 

altered by replacing one or more specific proteinase target sites with one 
or more other specific proteinase target sites. 
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33. The DNA sequence of claim 29, wherein, wherein said proteinase 
target sites are specific for bovine trypsin, Strppt.omvces griseus trypsin, 
papain, porcine elastase, bovine chymosin, bovine chymotrypsin, Staphvlococ- _ 
cus aureus strain V8 proteinase, human plasmin, bovine thrombin, thermoly- 
sin, subtil isin Novo and/or Streptomvce s griseus proteinase B. 

34. The DNA sequence of claim 29, wherein the bait region has been 
altered by replacing said bait region or part thereof with" a bait region or 
a part thereof from another a-macroglobul in. 

35. The DNA sequence of claim 34, wherein said bait region originates 
from human a^, Pregnancy Zone Protein (PZP), rat a,M, rat o^, rat a,I 3 
variant 1, or rat a,I 3 variant 2, especially PZP. 

35. A functionally operative expression vector comprising a gene in 

accordance with any of the claims 25 to 35 for the expression of human a 2 - 
macroglobulin, variants, fragments or derivatives thereof, or alleles of 
such a gene. 

37. The vector of claim 36, further comprising regulatory elements 
necessary for the stable maintenance of said vector in mammalian cells. 

38. The vector of claim 36 or 37, further comprising sequences 
providing for the processing and secretion of the expressed product. 

39. The vector of any of the claims 36 to 38, further comprising one 
or more other genes encoding for a desired gene product. 



40. A functionally operative expression vector comprising a gene 
30 encoding for the expression of an a-macroglobul in, variants, fragments or 

derivatives thereof, or alleles of such a gene, essentially as described. 

41 . A transformed host compri sing a functionally operative expression 
vector comprising a gene encoding for the expression of human o^-macro- 

35 globulin or fragments or derivatives thereof, or alleles of such a gene. 

42. The host of claim 41, wherein said vector is the vector of any 
of the claims 36 to 40. 
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43. The host of claim 41 or 42, wherein said host is a bacterial 

strain, a fungal strain, a mammalian cell line, or a mammal. 



5 



44. The host of claim 43, wherein said host is a fungus. ~~ 

45. The host of claim 44, wherein said fungus belongs to the genus 
Aspergillus 

46. The host of claim 44, wherein said host is a yeast 

10 

47. The host of claim 46, wherein said host belongs to the genus Sac- 
charomvces . 

48. The host of claim 43, wherein said host is a mammalian cell line. 

1 5 

49. The host of claim 48, wherein said host is a Syrian Baby Hamster 
Kidney (BHK) cell line. 

50. The host of claim 49, wherein said cell line is available from 
20 ATCC under No. CRL 1632. 

51- Recombinant human a 2 -macroglobul in of SEQ ID NO:2 or SEQ ID N0:4 

in an active form. 

25 52. Recombinant a-macroglobul in, variants, fragments or derivatives 

thereof produced by a process of any of the claims 1 to 24. 

53. Recombinant a-macroglobul in, variants, fragments or derivatives 
thereof of claim 52 produced by the use of a vector of any of the claims 36 

30 to 40. 

54. Recombinant a-macroglobul in, variants, fragments or derivatives 
thereof essentially as described. 

35 55. Recombinant human a 2 -macroglobul in, variants, fragments or 

derivatives thereof essentially as described. 

56. A growth medium comprising one or more a-macroglobul ins. 
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57. A growth medium comprising recombinant a-macroglobulin, variants, 
fragments or derivatives thereof according to any of the claims 51 to 55. 

58. Use of recombinant a-macroglobulin, variants, fragments or 
5 derivatives thereof according to any of the claims 51 to 55 as a protein 

carrier in enzyme replacement therapy. 

59. use of recombinant a-macroglobulin, variants, fragments or 
derivatives thereof according to any of the claims 51 to 55 as a DNA carrier 

10 in gene therapy. 
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